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SYSTEMS AND METHODS FOR PREDICTING OLIGONUCLEOTIDE 
MELTING TEMPERATURES (T m s) 

CROSS-REFERENCES TO RELATED APPLICATIONS 

5 This application is a continuation-in-part of U.S. Application Serial No. 

09/796,988, filed February 28, 2001, which is a continuation-in-part of U.S. Application 
Serial No. 09/724,959, filed November 28, 2000; and which is a continuation-in-part of U.S. 
Application Serial No. 09/640,953, filed August 16, 2000; which is a continuation of U.S. 
Application Serial No. 09/054,832, filed April 3, 1998; and which is a continuation-in-part of 
10 U.S. Application Serial No. 09/431,385, filed November 1, 1999; which is a continuation of 
U.S. Application Serial No. 09/054,830, filed April 3, 1998, now U.S. Patent No. 6,127,121 ; 
the disclosures of each being incorporated herein by reference. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
1 5 FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT 

Not applicable 

BACKGROUND OF THE INVENTION 

The present invention generally relates to computer software systems and 
20 methods useful in the field of molecular biology relating to the use of oligonucleotides as 
probes and primers in liquid, solid and mixed phase assays. More particularly, the present 
invention relates to computer software systems and methods useful for predicting the melting 
temperatures (T m s) and other characteristics of oligonucleotides, particularly oligonucleotides 
including modified bases to improve the hybridization properties and discriminatory abilities 
25 of oligonucleotides that are used in arrays and*as probes and primers. 

Many techniques currently in use in molecular biology utilize oligonucleotides 
as probes and/or primers. It is often advantageous, in the practice of these techniques, to be 
able to distinguish between two or more sequences which are related but which differ by one 
or more nucleotides. For example, many mutations of clinical significance differ by only a 
30 single nucleotide from the wild-type sequence. Polymorphisms in mammalian genomes are 
also often characterized by sequence differences of one or a few nucleotides. The ability to 
make such a distinction is known as mismatch discrimination. In practical terms, mismatch 
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discrimination describes the property by which a defined sequence oligonucleotide, at a given 
stringency, hybridizes strongly (one manifestation of which is that the hybrids have a high 
melting temperature) to a target sequence with which it is complementary along its entire 
length (a perfect hybrid or perfect match), but hybridizes detectably more weakly to a target 
5 sequence that is non-complementary to the sequence of the oligonucleotide at one or a few 
nucleotides (a mismatch). The differences in hybridization strength are such that a particular 
stringency can be selected at which a perfect match is detectable as a hybrid and a mismatch 
fails to form a hybrid. 

In a nucleic acid duplex, each base pair contributes to stability. Hence, the 

10 shorter the duplex, the greater the relative contribution of each individual base pair to the 
stability of the duplex. As a result, the difference in stability between a perfect match and a 
mismatch will be greater for shorter oligonucleotides. However, short oligonucleotides 
hybridize weakly, even to a perfectly complementary sequence, and thus must be hybridized 
under conditions of reduced stringency. Thus, the potential discriminatory power of short 

1 5 oligonucleotides cannot be easily realized except under conditions of low stringency. 

What is needed in the art are new methods for mismatch discrimination, 
particularly for single-nucleotide mismatches, under conditions of high stringency; for 
example, at the elevated temperatures characteristic of most nucleic acid amplification 
reactions. What is also needed in the art are new systems and methods for analyzing and 

20 predicting properties, such as T m , and other characteristics of nucleotide sequences. 

Surprisingly, the present invention provides such systems and methods, along with new 
reagents and compositions which can be used in the methods. 



25 BRIEF SUMMARY OF THE INVENTION 

The present invention provides a number of modified oligonucleotides found 
to have exceptional properties and usefulness in a variety of assays. Accordingly, the present 
invention also provides systems and methods for using the modified oligonucleotides 
described herein, including computer software systems and methods. 
30 In one aspect, the present invention provides modified oligonucleotides having 

at least two bases selected from unsubstituted and 3-substituted pyrazolo[3,4-d]pyrimidine 
bases. In preferred embodiments, the oligonucleotides having modified bases will further 
comprise other moieties such as detectable labels, fluorescence and chemiluminescence 
quenchers and/or minor groove binders and/or other types of modified bases or base analogs. 
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In another aspect, the present invention provides modified oligonucleotides 
having at least one 5-substituted pyrimidine base and at least one 3-substituted pyrazolo[3,4- 
d]pyrimidine base. In preferred embodiments, these modified oligonucleotides will further 
comprise other moieties (as above) such as detectable labels, fluorescence and 
5 chemiluminescence quenchers and/or minor groove binders. 

In yet another aspect, the present invention provides methods for 
distinguishing polynucleotides having related sequences. 

In still another aspect, the present invention provides methods for detecting the 
presence of a target sequence in a polynucleotide. 
10 In still other aspects, the present invention provides methods for primer 

extension, and methods for determining the nucleotide sequence of a polynucleotide. 

In related aspects, the present invention provides methods for examining gene 
expression in a cell, and methods for identifying a mutation or polymorphism in a target 
sequence of a gene of interest. 
15 In still another aspect, the present invention provides a number of modified 

bases that are useful in preparing modified oligonucleotides for the methods described herein 
and other conventional assays and methods. 

In yet another aspect, the present invention provides modified oligonucleotide 
arrays wherein the array members have T m s within about 1-2 °C and lengths within 1-2 bases 
20 from each other. Methods are also provided for determining sequences of the array members. 

In yet a further aspect, the present invention provides software and computer 
systems and methods for determining the T m s of nucleotide sequences, and in particular 
oligonucleotides having modified bases, using nearest neighbor thermodynamic parameters 
as described herein. 

25 Reference to the remaining portions of the specification, including the 

drawings and claims, will realize other features and advantages of the present invention. 
Further features and advantages of the present invention, as well as the structure and 
operation of various embodiments of the present invention, are described in detail below with 
respect to the accompanying drawings. In the drawings, like reference numbers indicate 

30 identical or functionally similar elements. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figures 1 A and IB provide structures for several modified bases and their 
abbreviations. The wavy line is used to denote the position of an attached sugar moiety 
(unprotected, protected, activated and the like); 
5 Figure 2 is a graph illustrating the balancing of the T m of 8-mer GC-rich and 

AT-rich probes with different combinations of MGB, PPPA and PU; 

Figure 3 is a chart which illustrates one advantage achieved through the use of 
PPPA and PPG in MGB-modified oligonucleotide probes. As seen in the figure, the 
modified bases allow shortening of the probe that shows increased mismatch (hscrimination 
10 in real-time PCR. A is PPPA and G is PPG; 

Figure 4 illustrates an Invader™ assay in which the modified oligonucleotides 
of the invention can be used; 

Figure 5 illustrates a comparison of Invader™ probe performance with 
different numbers of PPGs; 
1 5 Figure 6 illustrates a general overview of a web-based oligonucleotide probe 

design system according to an embodiment of the present invention; and 

Figure 7 illustrates an example probe design process flow according to one 
embodiment of the present invention. 

20 

DETAILED DESCRIPTION OF THE INVENTION 

Abbreviations and Definitions 

Abbreviations for a number of the modified bases described herein are 
25 provided as follows (structures of these bases are shown in Figures 1 A and IB): 6-amino-3- 
prop-1 -ynyl-5-hydropyrazolo[3,4-d]pyrimidine-4-one, PPPG; 6-amino-3-(3-hydroxyprop-l - 
yny)l-5-hydropyrazolo[3,4-d]pyrimidine-4-one, HOPPPG; 6-ammo-3-(3-aminoprop-l-ynyl)- 
5-hydropyrazolo[3,4-d]pyrimidine-4-one, NH 2 PPPG; 4-amino-3~(prop- 1 -ynyl)pyrazolo[3 ,4- 
d]pyrimidine, PPPA; 4-amino-3-(3-hydroxyprop-l-ynyl)pyrazolo[3,4-d]pyrimidine, - 
30 HOPPPA; 4-amino-3-(3-aminoprop4-ynyl)pyrazolo[3,4-d]pyrimidine, NH 2 PPPA; 3 -prop -1- 
ynylpyrazolo[3,4-d]pyrimidine-4,6-diamino, (NH 2 ) 2 PPPA; 2-(4,6-diaminopyrazolo[3,4- 
d]pyrimidin-3-yl)ethyn-l-ol, (NH 2 ) 2 PPPAOH; 3-(2-aminoethynyl)pyrazolo[3,4- 
d]pyrimidine-4,6-diamine, (NH 2 ) 2 PPPANH 2 ; 5-prop-l-ynyl4,3-dihydropyrimidine-2,4- 
dione, PU; 5-(3-hydroxyprop-l-ynyl)-l,3-dihydropyrimidine-2,4-dione, HOPU; 6-amino-5- 
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prop-l-ynyl-3-dihydropyrimidine-2-one, PC; 6-amino-5-(3-hydroxyprop-l-yny)-l,3- 
dihydropyrimidine-2-one, HOPC; and 6-amino-5-(3-aminoprop-l-yny)-l,3- 
dihydropyrimidine-2-one, NH 2 PC; 5-[4-amino-3-(3-methoxyprop-l-ynyl)pyrazol[3,4- 
d]pyrimidinyl]-2-(hydroxymethyl)oxolan-3-ol, CH3OPPPA; 6-amino-l-[4-hydroxy-5- 
5 (hydroxymethyl)oxolan-2-yl]-3<3-m^ 

one, CH3OPPPG; 5^4-hydroxybuM-ynyl)-U^ HOBuU; 6- 

amino-5-(4-hydroxybut-l-ynyl)^ HOBuC; 4-(4-aminopyrazolo[3,4- 

d]pyrimidin-3-yl)but-3-yn-l-ol, HOBuPPA; 6-amino-3-(4-hydroxybut-l-ynyi)pyrazolo[3,4- 
d]pyrimidin-4-ol, HOBuPPG; 4-(4,6^aminopyrazolo[3,4-dft^ 

10 (NH 2 ) 2 BuPPAOH; 3-iodo-lH-pyrazolo[3,4^]pyrimidine^,6-diamine ((NH 2 )2PPAI); 3- 
bromo4H-pyrazolo[3,4-d]pyrimidine-4,6-diamine ((NH 2 ) 2 PPABr); 3-chloro-lH- 
pyrazolo[3,4-d]pyrimidine--4,6-diainine ((NH 2 ) 2 PPAC1); 3-Iodo-lH-pyrazolo[3,4- 
d]pyrimidin-4-ylainine (PPAT); 3-Bromo-m-pyrazolo[3,4<]pyri^^ (PPABr); 
3^Uoro-lH-pyrazolo[3,4-d]pyrimidin-4-ylamine(PPACl). 

15 Unless otherwise stated, the following terms used in the specification and 

claims have the meanings given below: 

The term "alkyl" refers to a linear, branched, or cyclic saturated monovalent 
hydrocarbon radical or a combination of cyclic and linear or branched saturated monovalent 
hydrocarbon radicals having the number of carbon atoms indicated in the prefix. For 

20 example, (Ci-Cg)alkyl is meant to include methyl, ethyl, n-propyl, 2-propyl, tert-butyl, 

pentyl, cyclopentyl, cyclopropylmethyl and the like. For each of the definitions herein (e.g., 
alkyl, alkenyl, alkoxy, aralkyloxy), when a prefix is not included to indicate the number of 
main chain carbon atoms in an alkyl portion, the radical or portion thereof will have eight or 
fewer main chain carbon atoms. 

25 The term "alkylene" means a linear saturated divalent hydrocarbon radical or a 

branched saturated divalent hydrocarbon radical having the number of carbon atoms 
indicated in the prefix. For example, (Ci-C 6 )alkylene is meant to include methylene, 
ethylene, propylene, 2-methylpropylene, pentylene, and the like. 

The term "alkenyl" refers to a linear monovalent hydrocarbon radical or a 

30 branched monovalent hydrocarbon radical having the number of carbon atoms indicated in 
the prefix and containing at least one double bond. For example, (C 2 -C 6 )alkenyl is meant to 
include, ethenyl, propenyl, and the like. 

The term "alkynyl" refers to a linear monovalent hydrocarbon radical or a 
branched monovalent hydrocarbon radical containing at least one triple bond and having the 
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number of carbon atoms indicated in the prefix. For example, (C 2 -C6)alkynyl is meant to 

include ethynyl, propynyl, and the like. 

The terms "alkoxy," "alkylamino" and "alkylthio" (or thioalkoxy) are used in 

their conventional sense, and refer to those alkyl groups attached to the remainder of the 
5 molecule via an oxygen atom, an amino group, or a sulfur atom, respectively. Similarly, the 

term dialkylamino refers to an amino group having two attached alkyl groups that can be the 

same or different. 

The term "aryl" means a monovalent monocyclic or bicyclic aromatic 

hydrocarbon radical of 6 to 10 ring atoms which is unsubstituted or substituted independently 
10 with one to four substituents, preferably one, two, or three substituents selected from alkyl, 

cycloalkyl, cycloalkyl-alkyl, halo, nitro, cyano, hydroxy, alkoxy, amino, acylamino, mono- 

alkylamino, di-alkylamino, haloalkyl, haloalkoxy, heteroalkyl, COR (where R is hydrogen, 

alkyl, cycloalkyl, cycloalkyl-alkyl, phenyl or phenylalkyl), -(CR'R'VCOOR (where n is an 

integer from 0 to 5, R' and R" are independently hydrogen or alkyl, and R is hydrogen, alkyl, 
1 5 cycloalkyl, cycloalkylalkyl, phenyl or phenylalkyl) or -(CR'R'VCONR^ (where n is an 

integer from 0 to 5, R' and R" are independently hydrogen or alkyl, and R a and R^ are, 
independently of each other, hydrogen, alkyl, cycloalkyl, cycloalkylalkyl, phenyl or 
phenylalkyl). More specifically the term aryl includes, but is not limited to, phenyl, biphenyl, 
1-naphthyl, and 2-naphthyl, and the substituted forms thereof. Similarly, the term 
20 "heteroaryl" refers to those aryl groups wherein one or more heteroatoms or heteroatom 
functional groups have replaced a ring carbon, while retaining aromatic properties, e.g., 
pyridyl, quinolinyl, quinazolinyl, thienyl, and the like. For brevity, the term aryl, when used 
in combination with other radicals (e.g., aryloxy, arylalkyl) is meant to include both aryl 
groups and heteroaryl groups as described above. 

25 The term "arylalkyl" refers to a radical -R a R^ where R a is an alkylene group 

(having the indicated number of carbon atoms, or if unspecified having six or fewer main 
chain carbon atoms) and R^ is an aryl group as defined herein. Examples of arylalkyl groups 
include benzyl, phenylethyl, 3-(3-chlorophenyl)-2-methylpentyl, and the like. 

Similarly the term "arylalkenyl" means a radical -R a R b where R a is an 

30 alkenylene group and R b is an aryl group as defined herein, e.g., 3-phenyl-2-propenyl, and 
the like. 
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"Arylheteroalkyl" means a radical -R a Rb where R» is an heteroalkylene group 
(having the indicated number of carbon atoms) and R» is an aryl group as defined herein, 
e.g., 2-hydroxy-2-phenyl-ethyI, 2-hydroxy-l-hydroxymethyl-2-phenyl-ethyl, and the like. 

The term "aryloxy", refers to a radical -OR where R is an aryl group, e.g., 
phenoxy, naphthyloxy and the like. 

The prefix "halo" and the term "halogen" when used to describe a substituent, 
refer to -F, -CI, -Br and -I. 

The term "heteroalkyl" refers to an alkyl radical as defined herein with one, 
two or three substituents independently selected from cyano, -OR a , -NR'R 0 , and -S(0) n R d 
(where n is an integer from 0 to 2 ), with the understanding that the point of attachment of the 
heteroalkyl radical is through a carbon atom of the heteroalkyl radical. R a is hydrogen, alkyl, 
aryl, arylalkyl, alkoxycarbonyl, aryloxycaibonyl, carboxamido, or mono- or di- 
alkylcarbamoyl. R b is hydrogen, alkyl, aryl or arylalkyl. R c is hydrogen, alkyl, aryl, 
arylalkyl, alkoxycarbonyl, aryloxycarbonyl, carboxamido, mono- or di-alkylcarbamoyl or 
alkylsulfonyl. R d is hydrogen (provided that n is 0), alkyl, aryl, arylalkyl, amino, mono- 
alkylamino, di-alkylamino, or hydroxyalkyl. Representative examples include, for example, 
2-hydroxyethyl, 2,3-dihydroxypropyl, 2-methoxyethyL benzyloxymethyl, 2-cyanoethyl, and 
2-methylsulfonyl-ethyl. For each of the above, R a , R b ,R C , and R d can be further substituted 
by NH 2 , fluorine, alkylamino, di-alkylamino, OH or alkoxy. Additionally, the prefix 
indicating the number of carbon atoms (e.g., C,-C, 0 ) refers to the total number of carbon 
atoms in the portion of the heteroalkyl group exclusive of the cyano, -OR", -NR^, or 
-S(0) n R d portions. 

The term "heterocyclyl" refers to a saturated or unsaturated non-aromatic 
cyclic radical of 3 to 8 ring atoms in which one or two ring atoms are heteroatoms selected 
from O, NR (where R is independently hydrogen or alkyl) or S(0) n (where n is an integer 
from 0 to 2), the remaining ring atoms being C, where one or two C atoms may optionally be 
replaced by a carbonyl group. The heterocyclyl ring may be optionally substituted 
independently with one, two, or three substituents selected from alkyl, halo, nitro, cyano, 
hydroxy, alkoxy, amino, mono-alkylamino, di-alkylamino, haloalkyl, haloalkoxy, -COR 
(where R is hydrogen, alkyl, cycloalkyl, cycloalkyl-alkyl, phenyl or phenylalkyl), -(CR'R") n - 
COOR (n is an integer from 0 to 5, R' and R" are independently hydrogen or alkyl, and R is 
hydrogen, alkyl, cycloalkyl, cycloalkyl-alkyl, phenyl or phenylalkyl), or^CR'R'V 
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CONRa R b (where nism integer from 0 tQ 5j R , md R „ ^ independently hydrogen or alkyl, 
and Ra and Rb are, independently of each other, hydrogen, alkyl, phenyl or phenylalkyl). 
More specifically the term heterocyclyl includes, but is not limited to, tetrahydropyranyl, 
piperidino, N-methylpiperidin-3-yl, piperazino, N-methylpyrrolidin-3-yl, 3-pyrrolidino, 2- 
pyrroUdon-l-yl, morpholino, thiomorpholino, thiomorpholino-l-oxide, thiomorpholino-1,1- 
dioxide, pyrrolidinyl, and the derivatives thereof. The prefix indicating the number of carbon 
atoms (e.g., C3-C10) refers to the total number of carbon atoms in the portion of the 
heterocyclyl group exclusive of the number of heteroatoms. 

The terms "heterocyclylalkyl," "heterocyclylalkenyl," "heterocyclylalkynyl" 
refer to radicals -R a Rb where Ra is an alkylene, alkenylene or alkynylene group, 
respectively, and R*> is a heterocyclyl group as defined herein, e.g., tetrahydropyran-2- 
ylmethyl, 4-methylpiperazin-l-ylethyl, 3-piperidinylmethyl, and the like. 

The terms "heteroalkylene" means a linear saturated divalent hydrocarbon 
radical of one to six carbons or a branched saturated hydrocarbon radical of three to six 
carbon atoms with one, two or three substituents independently selected from -OR", -NR b R c , 
and -S(0)nR d (where n is an integer from 0 to 2 ) where, R a , Rb, Rc, and R d are as defined 
herein for a heteroalkyl radical. Examples include, 2-hydroxyethan-l,2-diyl, 2- 
hydroxypropan-l,3-diyl and the like. 

Each of the above terms (e.g., "alkyl," "heteroalkyl," and "aryl") are meant to 
include both substituted and unsubstituted forms of the indicated radical. Preferred 
substituents for each type of radical are provided below. 

Substituents for the alkyl and heteroalkyl radicals (including those groups 
often referred to as alkylene, alkenyl, heteroalkylene, heteroalkenyl, alkynyl, 
heterocycloalkyl, and heterocycloalkenyl) can be a variety of groups selected from: -OR', 
=0, =NR\ =N-OR', -NR'R", -SR\ -halogen, -SiR'R"R"\ -OC(0)R', -C(0)R', -C0 2 R', - 
CONR'R", -OC(0)NR'R", -NR"C(0)R', -NR'-C(0)NR"R"', -NR"C(0) 2 R', -NH- 
C(NH 2 )=NH, -NR'C(NH 2 )=NH, -NH-C(NH 2 )=NR', -S(0)R', -S(0) 2 R', -S(0) 2 NR'R", -CN 
and -N0 2 in a number ranging from zero to four, preferably, zero, one, two or three 
substituents. R', R" and R"' each independently refer to hydrogen, unsubstituted (d- 
C 8 )alkyl and heteroalkyl, unsubstituted aryl, aryl substituted with 1-3 halogens, unsubstituted 
alkyl, alkoxy or thioalkoxy groups, or aryl-(Ci-C 4 )alkyl groups. When R' and R" are 
attached to the same nitrogen atom, they can be combined with the nitrogen atom to form a 5- 
, 6-, or 7-membered ring. For example, -NR'R" is meant to include 1-pyrrolidinyl and 4- 
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morpholinyl. From the above discussion of substituents, one of skill in the art will 
understand that the term "alkyl" in its broadest sense is meant to include groups such as 
haloalkyl (e.g., -CF 3 and -CH 2 CF 3 ) and acyl {e.g., -C(0)CH 3 , -C(0)CF 3 , -C(0)CH 2 OCH 3 , 
and the like). Preferably, the alkyl groups will have from 0-3 substituents, more preferably 0, 
1, or 2 substituents, unless otherwise specified. 

Similarly, substituents for the aryl groups are varied and are selected from: - 
halogen, -OR', -OC(0)R\ -NR'R", -SR\ -R', -CN, -N0 2 , -CO z R', -CONR'R", -C(0)R', - 
OC(0)NR'R", -NR"C(0)R', -NR"C(0) 2 R', ,-NR'-C(0)NR"R"', -NH-C(NH 2 )=NH, - 
NR'C(NH 2 )=NH, -NH-C(NH 2 )=NR\ -S(0)R', -S(0) 2 R', -S(0) 2 NR'R", -N 3 , -CH(Ph) 2 , 
perfluoro(C,-C 4 )alkoxy, and perfluoro(Ci-C 4 )alkyl, in a number ranging from zero to the 
total number of open valences on the aromatic ring system; and where R\ R" and R"' are 
independently selected from hydrogen, (C,-C 8 )alkyl and heteioalkyl, unsubstituted aryl and 
heteroaryl, (unsubstituted aryl)-(Ci-C 4 )alkyl, and (unsubstituted aryl)oxy-(d-C4)alkyl. 

Two of the substituents on adjacent atoms of the aryl or heteroaryl ring may 
optionally be replaced with a substituent of the formula -T-C(0)-(CH 2 ) q -U-, wherein T and U 
are independently -NH-, -O-, -CH 2 - or a single bond, and q is an integer of from 0 to 2. 
Alternatively, two of the substituents on adjacent atoms of the aryl or heteroaryl ring may 
optionally be replaced with a substituent of the formula -A-(CH 2 )r-B-, wherein A and B are 
independently -CH 2 -, -0-, -NH-, -S-, -S(O)-, -S(0) 2 -, -S(0) 2 NR'- or a single bond, and r is an 
integer of from 1 to 3. One of the single bonds of the new ring so formed may optionally be 
replaced with a double bond. Alternatively, two of the substituents on adjacent atoms of the 
aryl or heteroaryl ring may optionally be replaced with a substituent of the formula -(CH 2 ) S - 
X-(CH 2 ) r , where s and t are independently integers of from 0 to 3, and X is -0-, -NR'-, -S-, - 
S(O)-, -S(0) 2 -, or -S(0) 2 NR'-. The substituent R' in -NR'- and -S(0) 2 NR'- is selected from 
hydrogen or unsubstituted (Ci-C 6 )alkyl. 

Certain compounds or oligonucleotides of the present invention may exist in a 
salt form. Such salts include base addition salts such as sodium, potassium, calcium, 
ammonium, organic amino, or magnesium salt, or a similar salt. When the compounds or 
modified oligonucleotides of the present invention contain relatively basic functionalities, 
acid addition salts can be obtained by contacting the neutral form of such compounds with a 
sufficient amount of the desired acid, either neat or in a suitable inert solvent. Examples of 
acceptable acid addition salts include those derived from inorganic acids like hydrochloric, 
hydrobromic, nitric, carbonic, monohydrogencarbonic, phosphoric, 
monohydrogenphosphoric, dihydrogenphosphoric, sulfuric, monohydrogensulfuric, 
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hydriodic, or phosphorous acids and the like, as well as the salts derived from organic acids 
like acetic, propionic, isobutyric, maleic, malonic, lactic, benzoic, succinic, suberic, fumaric, 
mandelic, phthalic, benzenesulfonic, p-tolylsulfonic, citric, tartaric, methanesulfonic, and the 
like. Also included are salts of amino acids such as arginate and the like, and salts of organic 
5 acids like glucuronic or galactunoric acids and the like (see, for example, Berge, S.M., et al, 
"Pharmaceutical Salts", Journal of Pharmaceutical Science, 1977, 66, 1-19). Certain specific 
compounds of the present invention contain both basic and acidic functionalities that allow 
the compounds to be converted into either base or acid addition salts. 

The neutral forms of the compounds may be regenerated by contacting the salt 
10 with a base or acid and isolating the parent compound in the conventional manner. The 
parent form of the compound differs from the various salt forms in certain physical 
properties, such as solubility in polar solvents, but otherwise the salts are equivalent to the 
parent form of the compound for the purposes of the present invention. 

Certain compounds of the present invention can exist in unsolvated forms as 
15 well as solvated forms, including hydrated forms. In general, the solvated forms are 

equivalent to unsolvated forms and are intended to be encompassed within the scope of the 
present invention. Certain compounds of the present invention may exist in multiple 
crystalline or amorphous forms. In general, all physical forms are equivalent for the uses 
contemplated by the present invention and are intended to be within the scope of the present 
20 invention. 

Certain compounds of the present invention possess asymmetric carbon atoms 
(optical centers) or double bonds; the racemates, diastereomers, geometric isomers and 
individual isomers are all intended to be encompassed within the scope of the present 
invention. The methods for the determination of stereochemistry and the separation of 
isomers are well-known in the art (see discussion in Chapter 4 of "Advanced Organic 
Chemistry", 4th edition J. March, John Wiley and Sons, New York, 1992). 

The compounds of the present invention may also contain unnatural 
proportions of atomic isotopes at one or more of the atoms that constitute such compounds. 
For example, the compounds may be radiolabeled with radioactive isotopes, such as for 
30 example tritium ( 3 H), iodine-125 ( ,25 I) or carbon-14 ( ,4 C). All isotopic variations of the 
compounds of the present invention, whether radioactive or not (e.g, 2 H), are intended to be 
encompassed within the scope of the present invention. 

"Protecting group" or "protected form thereof refers to a grouping of atoms 
that when attached to a reactive group in a molecule masks, reduces or prevents that 

10 
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reactivity. Examples of protecting groups can be found in T.W. Greene and P.G. Futs, 
Protective Groups in Organic Chemistry, (Wiley, 2nd ed. 1991), Beaucage and Iyer, 
Tetrahedron 48:2223-23 1 1 (1992), and Harrison and Harrison et al, Compendium of 
Synthetic Organic Methods, Vols. 1-8 (John Wiley and Sons. 1971-1996). Representative 
amino protecting groups include formyl, acetyl, trifluoroacetyl, benzyl, benzyloxycarbonyl 
(CBZ), tert-butoxycarbonyl (Boc), trimethyl silyl (TMS), 2-trimethylsilyl-ethanesulfonyl 
(SES), trityl and substituted trityl groups, allyloxycarbonyl, 9-fluorenylmethyloxycarbonyl 
(FMOC), nitro-veratryloxycarbonyl (NVOC) and the like. Representative hydroxy protecting 
groups include those where the hydroxy group is either acylated or alkylated such as benzyl 
and trityl ethers as well as alkyl ethers, tetrahydropyranyl ethers, trialkylsilyl ethers and allyl 
ethers. 

"Optional" or "optionally" in the above definitions means that the 
subsequently described event or circumstance may but need not occur, and that the 
description includes instances where the event or circumstance occurs and instances in which 
it does not. For example, "heterocyclo group optionally mono- or di- substituted with an 
alkyl group " means that the alkyl may but need not be present, and the description includes 
situations where the heterocyclo group is mono- or disubstituted with an alkyl group and 
situations where the heterocyclo group is not substituted with the alkyl group. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques in organic chemistry, biochemistry, oligonucleotide synthesis and 
modification, bioconjugate chemistry, nucleic acid hybridization, molecular biology, 
microbiology, genetics, recombinant DNA, and related fields as are within the skill of the art. 
These techniques are folly explained in the literature. See, for example, Maniatis, Fritsch & 
Sambrook, MOLECULAR CLONING: A LABORATORY MANUAL, Cold Spring Harbor 
Laboratory Press (1982); Sambrook, Fritsch & Maniatis, MOLECULAR CLONING: A 
LABORATORY MANUAL, Second Edition, Cold Spring Harbor Laboratory Press (1989); 
Ausubel, et al, CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & 
Sons (1987, 1988, 1989, 1990, 1991, 1992, 1993, 1994, 1995, 1996); Gait(ed.), 
OLIGONUCLEOTIDE SYNTHESIS: A PRACTICAL APPROACH, IRL Press (1984); 
Eckstein (ed.), OLIGONUCLEOTIDES AND ANALOGUES: A PRACTICAL 
APPROACH, IRL Press (1991). 
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General 

Oligonucleotides are short polymers of nucleotides, generally less than 200 
nucleotides, preferably less than 150 nucleotides, more preferably less than 100 nucleotides, 
more preferably less than 50 nucleotides and most preferably less than 21 nucleotides in 
5 length. Polynucleotides are generally considered, in the art, to comprise longer polymers of 
nucleotides than do oligonucleotides, although there is an art- recognized overlap between the 
upper limit of oligonucleotide length and the lower limit of polynucleotide length. With 
respect to the present invention, "oligonucleotide" generally refers to a nucleic acid, usually 
comprising a detectable label, that is used as a probe or as a primer; while polynucleotide 

10 refers to a nucleic acid containing a target sequence. Consequently, for the purposes of the 
present invention, the terms "oligonucleotide" and "polynucleotide" shall not be considered 
limiting with respect to polymer length. 

The present invention provides modified oligonucleotides having new and 
surprising properties of superior mismatch discrimination, compared to unmodified 

15 oligonucleotides. Modified oligonucleotides of the invention are used as probes, wherein 
their hybridization to a target sequence is detected, or as primers, wherein their hybridization 
to a target sequence is followed by polynucleotide synthesis initiated from the 3' terminus of 
the modified oligonucleotide, and the synthesized product (i.e., the extension product) is 
detected. 

20 A tar S et sequence refers to a nucleotide sequence which comprises a site of 

hybridization for a probe or a primer. Target sequences can be found in any nucleic acid 
including, but not limited to, genomic DNA, cDNA, RNA and any amplified product thereof, 
and can comprise a wild-type gene sequence, a mutant gene sequence, a non-coding 
sequence, a regulatory sequence, etc. A target sequence will generally be less than 100 

25 nucleotides, preferably less than 50 nucleotides, and most preferably, less than 21 nucleotides 
in length. 

Description of the Embodiments 



30 



The present invention provides a number of modified oligonucleotides which 
can generally be divided into three groups. 

The first group of modified oligonucleotides are those having at least two 
modified bases replacing the naturally-occurring bases. Here, the modified bases will be 
unsubstituted or 3-substituted pyrazolo[3,4-d]pyrimidines. In some embodiments, however, 
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the modified bases will be selected such that at least one of the bases is a 5-substituted 
pyrimidine and at least one of the bases is an unsubstituted or a 3-substituted pyrazolo[3,4- 
d]pyrimidine. Preferably, this group of modified oligonucleotides will have additional 
attached groups (e.g., minor groove binders, reporter groups, quenchers, etc.) which assist 
during assays to detect target sequences. 

The second group of modified oligonucleotides are those having at least one 
modified base, but also having an attached minor groove binder, reporter group, quencher, c 
the like. 

The third group of modified oligonucleotides are those that comprise one or 
more of the novel modified bases described below. As in the first group, this group of 
modified oligonucleotides will preferably have attached groups selected from, for example, 
minor groove binders, reporter groups or quenchers. 



Modified Oligonucleotides 



In one aspect, the present invention provides modified oligonucleotides 
comprising at least two bases selected from the group consisting of unsubstituted and 3- 
substituted pyrazolo[3,4-d]pyrimidine bases. 

The pyrazolo[3,4-d]pyrimidine bases are those bases in which a pyrazole ring 
is fused to a pyrimidine ring in the following specific orientation: 




vwv 



la 

in which the wavy line indicates the point of attachment between the base and either an 
attached ougomer-forming sugar or an amino acid involved in peptide nucleic acid formation. 
Additionally, the groups X 1 and X 2 are independently H, OH or NH 2 such that the 
pyra2»lo[3,4-d]pyrimidine bases approximate the construction of the natural purine bases, 
guanosine, adenine and inosine, as well as related derivatives of those bases. In this group of 
embodiments, an "unsubstituted pyrazolo[3,4-d]pyrimidine base" refers to those bases of the 
general formula la, while the term "3-substituted pyrazolo[3,4-d]pyrimidine base" refers to 
those bases having the formula lb, in which R l is defined as described below. 
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X 2 R 1 




fir 



N 



lb 

In formula lb, the symbols X 1 and X 2 independently represent H, OH, NH 2 or 
a protected form thereof. The symbol R 1 represents a member selected from (Q- 
5 Ci 2 )heteroalkyl, (C 2 -C 12 )heteroalkenyl, (C 2 -Ci 2 )heteroalkynyl, -0-(Ci-C 12 )alkyl, -0-(C 2 - 
Ci 2 )alkenyl, -0-(C 2 -C 12 )alkynyl, -S-(C,-Ci 2 )alkyl, -S-(C 2 -C 12 )alkenyl, -S-(C 2 -C, 2 )alkynyl, 
heterocyclyl(Ci-Ci 2 )alkyl, heterocyclyl(C 2 -Ci 2 )alkenyl, heterocyclyl(C 2 -Ci 2 )alkynyl, aryl(Ci- 
Ci 2 )alkyl, aryl(C 2 -C 12 )alkenyl, aryl(C 2 -C 12 )alkynyl, aryl, heterocyclyl, halogen, -CN, 
-CONH 2 and protected forms thereof. Preferred heteroalkyl, heteroalkenyl and heteroalkynyl 

1 0 groups are those that terminate (distal to the pyrazolo[3,4^]pyrimidine ring system) in a 
heteroatom group (e.g., OH, NH 2 , SH and the like). For example, preferred heteroalkyl, 
heteroalkenyl and heteroalkynyl groups include 3-amino-l-propyl, 4-hydroxy-l-butyl, 3- 
amino-l-propyn-l-yl, 3-hydroxy-l-propyn-l-yl, 4-hydroxy-3-hydroxymethyl-l-butyn-l-yl, 
4-hydroxy-l-butyn-l-yl, and their higher homologs. Other preferred R 1 groups include those 

15 which terminate in an aryl or heterocyclic group (e.g., heterocyclyl(C r Ci 2 )alkyl, 

heterocyclyl(C 2 -C 12 )alkenyl, heterocyclyl(C 2 -Ci 2 )alkynyl, aryl(C,-Ci 2 )alkyl, aryl(C 2 - 
Ci 2 )alkenyl and aryl(C 2 -Ci 2 )alkynyl). Preferred aryl and heterocyclic groups are either 
substituted or unsubstituted phenyl, thienyl, thiazolyl, imidazolyl, furanyl, oxazolyl, 
pyridinyl, pyrrolyl, indolyl, benzimidazolyl, benzthiazolyl and benzoxazolyl. 

20 m particularly preferred embodiments, the pyrazolo[3,4-d]pyrimidine base is 

selected from: 



R 1 NH 2 R 1 NH 2 R 1 



H 2 N' 





Ic Id Ie 

Still more preferably, the modified bases are selected from Ic, Id or Ie wherein 
25 each R l is a propynyl, hydroxypropynyl, aminopropynyl, aminobutynyl, hydroxybutynyl, or a 
substituted or unsubstituted phenyl, thienyl, thiazolyl, imidazolyl, furanyl, oxazolyl, 
pyridinyl, pyrrolyl, indolyl, benzimidazolyl, benzthiazolyl or benzoxazolyl group attached 
directly to the pyrazolo[3,4-d]pyrimidine ring system or attached to the ring system via a one 
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to four carbon linking group which can be saturated (e.g., ethylene, propylene, butylene) or 
unsaturated (e.g., acetylenyl, propynylene, butynylene, propenylene, butenylene). 

The modified oligonucleotides of the present invention have a backbone such 
as, for example, that found with the naturaUy-occuiring oligonucleotides or peptide-nucleic 
acids (e.g., heterocyclic bases attached to oligomer-forming sugars or peptide-nucleic acid- 
forming amino acids). Alternative oligomeric backbones are known to those of skill in the art 
(referred to below as "modified" backbones or comprising backbone modifications). For the 
purpose of this invention a modified oligonucleotide is any oligomeric polymer, including the 
natural phosphate backbone and non-natural backbones, that contains natural and/or modified 
bases and retains the ability to hybridize specifically to a complementary nucleic acid to form 
a stable duplex. 

In preferred embodiments, the modified oligonucleotides of the invention have 
a backbone of sugar or glycosidic moieties, preferably 2-deoxyribofuranosides wherein all 
intemucleotide linkages are the naturally occurring phosphodiester linkages. In alternative 
embodiments however, the 2-deoxy-p-D-ribofuranose groups are replaced with other sugars, 
for example, p-D-ribofuranose. In addition, P-D-ribofuranose may be present wherein the 2- 
OH of the ribose moiety is alkylated with a C,^ alkyl group (2-(0-C,^ alkyl) ribose) or with 
a C 2 -6 alkenyl group (2-(0-C 2 . 6 alkenyl) ribose), or is replaced by a fluoro group (2- 
fluororibose). Related ohgomer-foiming sugars useful in the present invention are those that 
are "locked", i.e., contain a methylene bridge between C-4' and an oxygen atom at C-2'. 
Other sugar moieties compatible with hybridization of the oligonucleotide can also be used, 
and are known to those of skill in the art, including, but not limited to, a-D- 
arabinofuranosides, a-2'-deoxyribofuranosides or ^3^ideoxy-3'-ammoribofuranosides. 
Oligonucleotides containing cu-D-arabinofuranosides can be prepared as described in U.S. 
Patent No. 5,177,196. Oligonucleotides containing 2^3'-dideoxy-3'-aniinoribofuranosides are 
described in Chen et al. ((1995) Nucleic Acids Res. 23:2661-2668). Synthetic procedures for 
locked nucleic acids (Singh et al, Chem. Comm., 455-456 (1998); Wengel J., Acc. Chem. 
Res., 32:301-310 (1998)) and oligonucleotides containing 2'-halogen-2'- 
deoxyribomranosides (Palissa et al., Z. Chem.. 27:216 (1987)) have been described.. The 
phosphate backbone of the modified oligonucleotides described herein can also be modified 
so that the oligonucleotides contain phosphorothioate linkages and/or methylphosphonates 
and/or phosphoroamidates (Chen et al., Nucl. Acids Res., 23:2662-2668 (1995)). 
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Combinations of oligonucleotide linkages are also within the scope of the present invention. 
Still other backbone modifications are known to those of skill in the art. 

In another group of embodiments, the modified bases described herein are 
incorporated into PNA and DNA/PNA chimeras to balance T m s and provide modified 
oligonucleotides having improved mismatch discrimination. Various modified forms of 
DNA and DNA analogues have been used in attempts to overcome some of the disadvantages 
of the use of DNA molecules as probes and primers. Among these are peptide nucleic acids 
(PNAs, also known as polyamide nucleic acids). Nielsen et al. (1991) Science 254:1497- 
1500. PNAs contain heterocyclic base units, as found in DNA and RNA, that are linked by a 
polyamide backbone, instead of the sugar-phosphate backbone characteristic of DNA and 
RNA. PNAs are capable of hybridization to complementary DNA and RNA target sequences 
and, in fact, hybridize more strongly than a corresponding nucleic acid probe. The synthesis 
of PNA oligomers and reactive monomers used in the synthesis of PNA oligomers have been 
described in U.S. Patents No. 5,539,082; 5,714,331; 5,773,571; 5,736,336 and 5,766,855. 
Alternate approaches to PNA and DNA/PNA chimera synthesis and monomers for PNA 
synthesis have been summarized. Uhlmann et al. (1998) Angew. Chem, Int. Ed. 37:2796- 
2823. Accordingly, the use of any combination of normal bases, unsubstituted pyrazolo[3,4- 
d]pyrimidine bases (e.g., PPG and PPA), 3-substituted pyrazolo[3,4-d]pyrimidines, modified 
purine, modified pyrimidine, 5-substituted pyrimidines, universal bases or a minor groove 
binder to balance the T m of a PNA or DNA/PNA chimera is in the scope of this invention. A 
variety of universal bases are known in the art. Other universal bases have recently been 
described and are alos useful in the present invention (see, Seela, et al., XIV International 
Round Table: Nucleosides, Nucleotides and Their Biological Applications, September 10-14, 
2000, San Francisco California, pp. 40). The synthetic methods necessary for the synthesis of 
modified base monomeric units required for PNA and PNA/DNA chimeras synthesis are 
available in the art, see methods in this application and Uhlmann et al. Angew. Chem. Int. Ed. 
37:2796-2823 (1998). 

Similarly, the present invention demonstrates a combination of normal bases, 
unsubstituted pyrazolo[3,4-d]pyrimidine bases (e.g., PPG and PPA), 3-substituted 
pyrazolo[3,4-d]pyrimidines, modified purine, modified pyrimidine, 5-substituted 
pyrimidines, universal bases or a minor groove binder can be used to balance the T m of any 
oligonucleotide polymer or oligopolymer/DNA chimera. 

For the uses described herein, the modified oligonucleotides will preferably 
have from 4 to 70 bases, more preferably from 4 to 25 bases. In some embodiments, the 
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modified oligonucleotides will have 15 or fewer, or more preferably 10 or fewer bases. 
Additionally, the modified oligonucleotides will, in some embodiments, have 3, 4, 5, 6, 7, 8, 
or 9 or more modified bases (either unsubstituted or 3-substituted pyrazolo[3,4- 
djpvrimidines). For each of the embodiments wherein multiple 3-substituted pyrazolo[3,4- 
djpyrimidine bases are present, the modified bases can either be the same or different from 
each other. 

In addition to the modified base components, the modified oligonucleotides of 
the present invention will in some embodiments comprise additional pendant groups such as, 
for example, intercalators, lipophilic groups, minor groove binders, reporter groups, chelating 
agents, quenchers and cross-linking agents attached to one or more of the internally located 
nucleotide bases, to the 3', to the 5' end, to both ends, or can have such pendant groups 
attached both internally and at one or both ends. Methods suitable for attaching intercalator, 
lipophilic groups, minor grove binders, reporter groups, chelating agents, quenchers and 
cross-linking agents to oligonucleotides have been described in, for example, U.S. Patent 
Nos. 5,512,667 and 5,419,966, PCT publication WO 96/32496 and U.S. application Ser. No. 
09/457,616. The oligonucleotides of the invention can also have a relatively low molecular 
weight "tail moiety" attached either at the 3' or 5' end, or at both ends. By way of example a 
tail molecule can be a phosphate, a phosphate ester, an alkyl group, an aminoalkyl group, or a 
lipophilic group. The tail moiety can also link the intercalators, lipophilic groups, minor 
groove binders, reporter groups, chelating agents and cross-linking functionalities to the 
oligonucleotides of the invention. The nature of tail moieties and methods for obtaining 
oligonucleotides with various tail moieties are also described in the above-referenced U.S. 
Patent Nos. 5,512,667 and 5,419,966. 

Minor Groove Binders 

In one group of embodiments, the modified oligonucleotide will also have a 
covalently attached minor groove binder (MGB). A variety of suitable minor groove binders 
have been described in the literature. See, for example, Kutyavin, et al. U.S. Patent No. 
5,801,155; Wemmer, D.E., and Dervan P.B., Current Opinon in Structural Biology, 7:355- 
361 (1997); Walker, W.L., Kopka, J.L. and Goodsell, D.S., Biopolymers, 44:323-334 (1997); 
Zimmer, C & Wahnert, U. Prog. Biophys. Molec. Bio. 47:31-112 (1986) andReddy, B.S.P., 
Dondhi, S.M., and Lown, J. W., Pharmacol. Therap., 84:1-11 1 (1999). 

Suitable methods for attaching MGBs (as well as reporter groups such as 
fluorophores and quenchers described below) through linkers to oligonucleotides are 
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described in, for example, U.S. Patent Nos. 5,512,677; 5,419,966; 5,696,251; 5,585,481; 
5,942,610 and 5,736,626. 

The MGB can be attached at either or both ends of the oligonucleotide. In 
addition or alternatively, one or more MGBs can be attached in the interior of the 
oligonucleotide, depending on the length of the oligonucleotide. In general, conjugation of a 
MGB to either end of an oligonucleotide would provide the greatest degree of hybrid 
stability, since melting of an oligonucleotide duplex begins at the temiini. Nonetheless, if 
both ends of a duplex formed by an oligonucleotide are relatively stable, for example, due to 
a high G+C content, attachment of a MGB in the interior of an oligonucleotide (for instance, 
near an A+T-rich sequence) could also enhance stability. The intended use of the 
MGB-oligonucleotide conjugate may also place limitations on the location of the conjugated 
MGB. For example, if an oligonucleotide is designed to be used as a primer, the 3'-hydroxy 
group must be free and capable of being elongated by a polymerizing enzyme. Alternatively, 
an assay that requires an oligonucleotide possessing a labeled 5'-end would require internal 
or 3'-end attachment of a MGB. 

The location of a MGB within a MGB-modified oligonucleotide conjugate can 
also affect the discriminatory properties of such a conjugate. An unpaired region within a 
duplex will result in changes in the shape of the minor groove in the vicinity of the mispaired 
base(s). Since MGBs fit best within the minor groove of a perfectly-matched DNA duplex, 
mismatches resulting in shape changes in the minor groove would reduce binding strength of 
a MGB to a region containing a mismatch. Hence, the ability of a MGB to stabilize such a 
hybrid would be decreased, thereby increasing the ability of a MGB-oligonucleotide 
conjugate to discriminate a mismatch from a perfectly-matched duplex. On the other hand, if 
a mismatch lies outside of the region complementary to a MGB-oligonucleotide conjugate, 
discrirninatory ability for unconjugated and MGB-conjugated oligonucleotides of equal 
length is expected to be approximately the same. Since the ability of an oligonucleotide 
probe to discriminate single base pair mismatches depends on its length, shorter 
oligonucleotides are more effective in discriminating mismatches. The primary advantage of 
the use of MGB-oligonucleotides conjugates in this context lies in the fact that much shorter 
oligonucleotides compared to those used in the prior art (i.e., 20-mers or shorter), having 
greater discriminatory powers, can be used, due to the pronounced stabilizing effect of MGB 
conjugation. 

Preferred minor groove binders are those selected from the formulae: 
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25 






and 



the subscript m is an integer of from 2 to 5; the subscript r is an integer of from 2 to 10; and 
each R a and R b is independently a linking group to the modified oligonucleotide, H, -OR 0 , 
-NR c R d , -COOR c or -CONR°R d , wherein each R c and R d is selected from H, (C,- 
5 Ci 2 )heteroalkyl, (C 2 -Ci2)heteroalkenyl, (C 2 -Ci 2 )heteroalkynyl, (C,-C 12 )alkyl, (C 2 - 
C 12 )alkenyl, (C 2 -Ci 2 )alkynyl, aryl(Ci-Ci 2 )alkyl and aryl. 

Particularly preferred minor groove binders include the trimer of 3-carbamoyl- 
U-dmydro-(3i^)-pyrrolo[3,2-e]indole-7-carboxylate (CDPI 3 ), the pentamer of N- 
methylpyrrole-4-carbox-2-amide (MPC 5 ) and other minor groove binders that exhibit 
10 increased mismatch discrimination. Additional MGB moieties that will find use in the 
practice of the present invention are disclosed in co-owned U.S. Patent No. 5,801,155. In 
certain embodiments, the MGBs can have attached water solubility-enhancing groups (e.g., 
sugars or amino acids). 

15 Reporter Groups 

In another group of embodiments, the modified oligonucleotide will further 
comprise at least one covalently attached reporter group. The reporter groups can be attached 
using methods and linking groups described above for the MGBs. Suitable reporter groups 
for the present modified oligonucleotides include beads, nanoparticles (Taton, T.A. et al, 

20 Science 289: 1 757-1 760 (2000)), chemiluminescers, isotopes, enzymes and fluorophores. 
Preferably, the reporter group is a fluorophore (see, Haugland, R.P., Handbook OF 
Fluorescent Probes and Research Chemicals, Sixth Edition, Molecular Probes, Eugene, 
OR, 1996). Suitable fluorophores include the resorufin dyes, coumarin dyes, rhodamine 
dyes, cyanine dyes, BODIPY dyes and pyrenes. 



Quenchers 



Recently developed detection methods employ the process of fluorescence 
resonance energy transfer (FRET) for the detection of probe hybridization rather than direct 
detection of fluorescence intensity. In this type of assay, FRET occurs between a donor 
fluorophore (reporter) and an acceptor molecule (quencher) when the absorption spectrum of 
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the quencher molecule overlaps with the emission spectrum of the donor fluorophore and the 
two molecules are in close proximity. The excited-state energy of the donor fluorophore is 
transferred to the neighboring acceptor by a resonance dipole-induced dipole interaction, 
which results in quenching of the donor fluorescence. If the acceptor molecule is a 
5 fluorophore, its fluorescence may sometimes be increased. The efficiency of the energy 
transfer between the donor and acceptor molecules is highly dependent on distance between 
the molecules. Equations describing this relationship are known. The Forster distance (Ro) is 
described as the distance between the donor and acceptor molecules where the energy 
transfer is 50% efficient. Other mechanisms of fluorescence quenching are also known, such 

10 as, collisional and charge transfer quenching. There is extensive guidance in the art for 

selecting quencher and fluor pairs and their attachment to oligonucleotides (Haugland, R.P., 
Handbook of Fluorescent Probes and Research Chemicals, Sixth Edition, Molecular 
Probes, Eugene, OR, 1996; U.S. Patent Nos. 3,996,345 and 4,35 1,760 and the like). 

Preferred quenchers are described in co-owned U.S. Ser. No. 09/457,616 (filed 

15 December 8, 1999). 

Fluorophores and Quenchers 

In certain embodiments of the present invention, oligonucleotides comprising 
20 fluorescent labels (fluorophores) and/or fluorescence quenching agents are used. In a 

preferred embodiment, an oligonucleotide contains both a fluorophore and a quenching agent. 
Fluorescent labels include, but are not limited to, fluoresceins, rhodamines, cyanines, 
phycoerythrins, and other fluorophores as described herein. Still other suitable fluorophores 
are known to those of skill in the art. As noted above, quenching agents or quenchers can 
25 absorb energy emitted by a fluorophore so as to reduce the amount of fluorescence emitted 
(i.e., quench the emission of the fluorescent label). Different fluorophores are quenched by 
different quenching agents. In general, the spectral properties of a particular 
fluorophore/quenching agent pair are such that one or more absorption wavelengths of the 
quencher overlaps one or more of the emission wavelengths of the fluorophore, A preferred 
30 fluorophore/quencher pair can be selected by one of skill in the art by comparison of 
emission and excitation wavelengths according to the properties set forth above. 

For use in amplification assays conducted at elevated temperatures, such as a 
polymerase chain reaction, or other procedures utilizing thermostable enzymes, the label is 
preferably one that is stable at elevated temperatures. For assays involving polymerization, 
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the label is one that does not interfere with the activity of the polymerizing enzyme. 
Additionally, the label will be present at the 5' and/or 3' end of the oligonucleotide, and/or 
can also be present internally in a non-interfering position. Accordingly, the label can be 
attached to any of the bases, sugars or phosphate moieties of the oligonucleotide, or to any 
linking group that is itself attached to one of these moieties. 

While the invention has been described in terms of the various bases and other 
optional components, the overall structure of the modified oligonucleotides can also be 
expressed in a formula to indicate a desired construction. 

Thus, in one group of embodiments, the modified oligonucleotides have the 

foimula: 

? 

R 2 -f-M— l-R 3 
L J n 

wherein R 2 and R 3 represent the termini of the modified oligonucleotide; the subscript n is an 
integer of from 4 to 70, more preferably 4 to 25 and still more preferably 4 to 10; each B is a 
member independently selected from adenine, thymine, cytosine, guanine, uracil, an 
unsubstituted pyrazolo[3,4-d]pyrimidine and a 3-substituted pyraolo[3,4-d]pyrimidine; and 
each M is selected from an oligomer-forming sugar and a peptide-nucleic acid-forming amino 
acid, with the proviso that at least two of the Bs are selected from unsubstituted pyrazolo[3,4- 
d]pyrimidines and a 3-substituted pyrazolo[3,4-d]pyrimidines. In certain specific 
embodiments, R 2 and R 3 represent the 5'- and 3'-ends of a modified oligonucleotides wherein 
M is an oligomer-forming sugar (e.g., 2-deoxy-p-D-ribofuranose, P-D-ribofuranose, a-D- 
arabinofiiranosides, a-2'-deoxyribofuranosides, 2 , ,3 , -dideoxy-3 ! -aminoribofuranosides, and 
locked sugars). For those embodiments in which the modified oligonucleotides have attached 
minor groove binders, reporter groups, etc., each of the R 2 and R 3 groups are meant to include 
functional groups suitable for attachment of linking groups and the additional functional 
component (MGB, fluorophore, quencher, and the like). Such functional groups include, for 
example, hydroxy groups, amino groups, carboxylic acid or ester groups, phosphoric, 
phosphonic, or phosphinic acid or ester groups, sulfonic acid and ester groups, and the like. 
In other embodiments, MGBs, reporter groups and the like are attached to any of the interior 
base/backbone groups using conventional methodology. 
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In a related aspect, the present invention provides modified oligonucleotides 
comprising at least one 5-substituted pyrimidine base and at least one unsubstituted or 3- 
substituted pyrazo!o[3,4-d]pyrimidine base. The unsubstituted or 3-substituted pyrazolo[3,4- 
d]pyrimidine bases useful in this aspect of the invention are essentially those that have been 
described above (see formulae la and lb). A variety of 5-substituted pyrimidine bases are 
useful including those described in U.S. Patent Nos. 5,645,985 and 5,484,908. 

In preferred embodiments, the 5-substituted pyrimidine base has the formula: 

X 3 




Ha 

and the unsubstituted or 3-substituted pvrazolo[3,4-d]pyrimidine base has the formula: 

f R 1 

lb 

wherein each of the X 1 , X 2 and X 3 groups is independently selected from H, OH, NH 2 and a 
protected amino group; and each of the R 1 and R 4 groups is independently selected from (C r 
Ci 2 )heteroalkyl, (C 2 -C, 2 )heteroalkenyl, (C 2 -Ci 2 )heteroalkynyl, -0-(C r Ci 2 )alkyl, -0-(C 2 - 
C, 2 )alkenyl, -0-(C 2 -C, 2 )alkynyl, -S-(C,-Ci 2 )alkyl, -S-(C 2 -C 12 )alkenyl, -S-(C 2 -C, 2 )alkynyl, 
heterocyclyl(C,-C l2 )alkyl, heterocyclyl(C 2 -Ci 2 )alkenyl, heterocyclyl(C 2 -C, 2 )alkynyl, aryl(C,- 
Ci 2 )alkyl, aryl(C 2 -Ci 2 )alkenyl, aryl(C 2 -C l2 )alkynyl, aryl, heterocyclyl, halogen, -CN, 
-CONH 2 and protected forms thereof, with the additional feature that R 1 can also be H. 

In particularly preferred embodiments, the 5-substituted pyrimidine base is 

selected from: 




lib He 
wherein R 4 is preferably selected from propynyl, hydroxypropynyl, aminopropynyl, 
hydroxybutynyl or a substituted or unsubstituted phenyl, thienyl, thiazolyl, imidazolyl, 
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furanyl, oxazolyl, pyridinyl, pyrrolyl, indolyl, benzimidazolyl, benzthiazolyl or benzoxazolyl 
group attached directly to the pyrimidine ring or attached to the ring via a one to four carbon 
linking group which can be saturated (e.g., ethylene, propylene, butylene) or unsaturated 
(e.g., acetylenyl, propynylene, butynylene, propenylene, butenylene). 

In other preferred embodiments, the pyrazolo[3,4-d]pyrimidine base is 
selected from the preferred bases provided above (Ic, Id and Ie). Additionally, suitable bases 
are also depicted in Figure 1. 

As with the earlier aspect of the invention wherein the modified 
oligonucleotides comprise at least two unsubstituted or 3-substituted pyrazolo[3,4- 
d]pyrimidines, this aspect of the invention can similarly comprise additional groups such as 
MGBs and reporter groups (e.g., fluorophores, quenchers and the like) as well as linking 
groups suitable for the attachment of these additional components. 

Also preferred are those embodiments in which the modified oligonucleotide 
is represented by the formula: 

B 




wherein R 2 represents a first end of the modified oligonucleotide; R 3 represents a second end 
of the modified oligonucleotide; the subscript n is an integer of from 4 to 70; each B is 
independently selected from adenine, thymine, cytosine, guanine, uracil, a 5-substituted 
pyrimidine, an unsubstituted pyrazolo[3,4-d]pyrimidine and a 3-substituted pyrazolo[3,4- 
d]pyrimidine (with the proviso that at least one base is a 5-substituted pyrimidine and at least 
one base is a pyrazolo[3,4-d]pyrimidine); and each M is an oligomer-forming sugar or a 
peptide-nucleic acid-forming amino acid. 

In yet another aspect of the invention, modified oligonucleotides are provided 
which comprise an attached minor groove binder and in which as few as one of the bases is 
replaced with a modified base. Surprisingly, the combination of MGBs and a single modified 
base (or optionally multiple modified bases) leads to modified oligonucleotides having 
particularly useful properties for mismatch discrimination as well as primer extesions and 
other utilities described below. This group of modified oligonucleotides preferably comprise 
from 4 to 70 bases, and an attached minor groove binder, wherein at least one pf the bases is 
replaced by a modified base selected from the group consisting of 5-substituted pyrimidines 
and unsubstituted or 3-substituted pyrazolo[3,4-d]pyrimidines. 
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In this aspect of the invention, the minor groove binders, 5-substituted 
pyrimidines and unsubstituted or 3-substituted pyrazolo[3,4-d]pyrimidines can be essentially 
any of those components described above. 



In one group of embodiments, the modified oligonucleotide has at least one 5- 
substituted pyrimidine, preferably having the formula: 

X 3 




WW 



Ha 

wherein X 3 is selected from H, NH 2 , OH and SH; and R 4 is selected from (d- 
Ci 2 )heteroalkyl, (C 2 -Ci 2 )heteroalkenyl, (C 2 -Ci 2 )heteroalkynyl, -0-(Ci-C 12 )alkyl, -0-(C 2 - 
Ci 2 )alkenyl, -0-(C 2 -Ci 2 )alkynyl, -S-(Ci-Ci2)alkyl, -S-(C 2 -C, 2 )alkenyl, -S-(C 2 -Ci 2 )alkynyl, 
heterocyclyl(Ci-Ci 2 )alkyl, heterocyclyl(C 2 -Ci 2 )alkenyl, heterocyclyl(C 2 -Ci 2 )alkynyl, aryl(Ci- 
Ci 2 )alkyl, aryl(C 2 -Ci 2 )alkenyl, aryl(C 2 -Ci 2 )alkynyl, aryl, heterocyclyl, halogen, -CN, 
-CONH 2 and protected forms thereof. In this group of embodiments, preferred aryl and 
heterocyclyl group (including those components of other groups, e.g., arylalkyl) are selected 
mom substituted and unsubstituted versions of phenyl, tolyl, pyridyl, thiazolyl, imidazolyl, 
furanyl, oxazolyl, thienyl, pyrrolyl, benzimidazolyl, benzoxazolyl, benzthiazolyl, indolyl, 
triazinyl, pyrimidinyl and naphthyl. Suitable substituents for these aryl and heterocyclyl 
groups are those provided in the general definitions above. 

More preferably, the modified oligonucleotides comprise at least one 5- 
substituted pyrimidine based group of Formula lid and lie, where the R 4 substituent group is 
selected from a substituted or unsubstituted alkyl group or alkynyl group. Preferably, R 4 is a 
hydroxypropynyl, hydroxybutynyl, aminopropynyl, aminobutynyl orpropynyl group, as well 
as the higher homologs (e.g., C 5 -C 8 ) of these alkynyl groups. Alternatively, the R 4 
substituent can be a 3-(hydroxymethyl)-4-hydroxy-l-butynyl group (described in more detail 
below). 
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In another group of preferred embodiments, the modified oligonucleotide has 
at least one modified base having the formula: 

f R 1 
lb 

wherein each of the X 1 and X 2 groups is independently selected from H, OH, NH 2 and a 
protected amino group; and R 1 is selected from H, (Ci-C 12 )heteroaUcyl, (C 2 - 
Ci 2 )heteroalkenyl, (C 2 -Ci 2 )heteroalkynyl, -0-(C r Ci 2 )alkyl, -0-(C 2 -Ci 2 )alkenyl, -0-(C 2 - 
Ci 2 )alkynyl, -S-(Ci-Ci 2 )alkyi, -S-(C 2 -Ci 2 )alkenyl, -S-(C 2 -C 12 )alkynyl, heterocyclyl(Ci- 
C l2 )alkyl, heterocyclyl(C 2 -Ci 2 )alkenyl, heterocyclyl(C 2 -Ci 2 )alkynyl, aryl(Ci-C I2 )alkyl, 
aryl(C 2 -Ci 2 )alkenyl, aryl(C r Ci 2 )alkynyl, aryl, heterocyclyl, halogen, -CN, -CONH 2 and 
protected forms thereof. In this group of embodiments, preferred aryl and heterocyclyl group 
(including those components of other groups, e.g., arylalkyl, heterocyclylalkyl) are selected 
from substituted and unsubstituted versions of phenyl, tolyl, pyridyl, thiazolyl, imidazolyl, 
furanyl, oxazolyl, thienyl, pyrrolyl, benzimidazolyl, benzoxazolyl, benzthiazolyl, indolyl, 
triazinyl, pyrimidinyl and naphthyl. Suitable substituents for these aryl and heterocyclyl 
groups are those provided in the general definitions above. 

In particularly preferred embodiments, the modified oligonucleotides comprise 
at least one monomer of formula If, 
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* N M 
O 

s II _ 

O" 




? < 
cr 

If 

wherein X 1 and X 2 are independently H, OH or NH 2 ; and R 1 is most preferably 3- 
hydroxypropyn-l-yl, propynyl, 3 -aminopropyn- 1 -yl, 4-hydroxy-l-butynyl, 3- 
(hydroxymethyl)-4-hydroxy-l-butynyl, halogen or 3,3,3-trifluoropropyn-l-yL Syntheses of 
some of these monomers have been reported (Balow et al., Nuc. Acid Res., 26:3350-3357 
(1998); Seela et al, /. Chem. Soc. Perkin Trans., 1, 479-488 (1999); Ramzaeva et al, Helv. 
Chim. Acta, 80:1809-1822 (1997)). 

Preparation of Modified Bases and Oligonucleotides 



Reaction Schemes 1-10 provide illustrative methods for preparing a number of 
modified bases (unsubstituted and 3-substituted pyrazolo[3,4-d]pyrimidines and 5-substituted 
pyrimidines) that are useful in the present invention. The schemes illustrate the preparation 
of phosphoramidite derivatives of the modified bases which can be used in, for example, 
automatic synthesizers for preparing the modified oligonucleotides of the invention. 

Reaction Scheme 1 illustrates the preparation of 5-(prop-2-ynyl-4- 
methylbenzoate)-5 , -0<4 > 4'-dimethoxytriphenylmethyl)-2'-deox 
N,N-diisopropylphosphoramidite] (3) starting from 5-iodo-2'-deoxyuridine. The starting 
materials was treated with prop-2-ynyl 4-methylbenzoate in the presence of Pd(PPh 3 ) 4 -CuI to 
yield the methylbenzoate derivative (1) which was then converted to the 5 '-blocked DMTr 
derivative (2) and then to the phosphoramidite (3). 



26 



WO 03/107144 



PCT/US03/19447 



Reaction Scheme 1 




Reaction Schemes 2 and 3 provide one method for the preparation of 
pyrazolo[3,4<]pyrimidine phosphoramidites. More particularly, these schemes illustrate the 
preparation of 3-[-4((lE)-l-aza-2-methylprop-l-enyl)-l-((2R,5R)-4- 
{|>is(memylemyl)ammo](2-cyanoethoxy)phosphinooxy} -5- {[bis(4- 

methoxvphenyl)phenylmemo X y]memyl}oxolan-2-yl)pyrazolo[3,4-d]pyrimidin-3-^ 
ylnyl 4-methylbenzoate (13; Ri=-OCOPhCH 3 ) is synthesized in two portions. 
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Reaction Scheme 2 




In the first portion (Reaction Scheme 2), (2R,5R)-5-(4-amino-3- 
iodopyrazolo[3,4-d]pyrimidinyl)-2-(hydroxymethyl)oxolan-3-^ (9) was synthesized starting 
5 from l,5-hydropyrazolo[3,4-d]pyrimidin-4-one. The starting material was treated with iodine 
monochloride and P2S5 successively to yield the iodo (4) and thione (5) derivatives 
respectively. The thione (5) was converted to the ethylthio-derivative (6), which was 
combined with a l-chloro-l,2-dideoxy-3,5-di»0-toluoylribofiiranose derivative to yield the 
blocked nucleoside (7). Reaction of compound (7) with sodium methoxide followed by 
10 ammonium hydroxide gave hydroxymethyl derivative (8) and compound (9) respectively. 

In the second portion (Reaction Scheme 3), 2, 3-[-4((lE)-l-aza-2-methylprop- 
1 -enyl)4-((2R,5R)-4- {[bis(methylethyl)amino](2-cyanoethoxy)phosphinooxy 
methoxyphenyl)phenylme&oxy]me 

ylnyl 4-methylbenzoate (13 Ri=-OCOPhCH 3 ) was prepared starting from compound (9). 
1 5 Compound (9) was reacted with prop-2-ynyl 4-methylbenzoate in the presence of Pd(PPh 3 ) 4 - 
Cul to yield the prop-2-ynyl derivative (10; Ri=-OCOPhCH 3 ). The amino group in this 
compound was protected by reaction with N,N dimethylacetamide dimethyl acetal to give 
(ll;Ri=-OCOPhCH 3 ). Compound (11; R^OCOPhCHb) was converted to the DMTr 
derivative (12; Ri=-OCOPhCH 3 ) and then to the phosphoramidite (13; Ri=-OCOPhCH 3 ). 
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Reaction Scheme 3 




Ri=Hor-OCOPhCHj 

Reaction Scheme 4 illustrates the preparation of N-{3-[l-((2R5R)-5-{bis(4- 
memoxyphenyl)phenylmemoxy]me%l}-4-{[bis(memylemyl)ammo](2- 
cyanoemoxy)phosphmooxy}oxolan-2-yl)-6-am™ 
3-yl)]propyl} -2,2,2-trifluoroacetamide (22). 



29 



WO 03/107144 



PCT/US03/19447 



OCH 3 




H 2 N^N^N 



ia 



Reaction Scheme 4 

OCH3 , 

1. K0H/H 2 0 



N 




H 2 N^N^N 



2-TolOCO^v 0 



OCH3 1 
H 2 N^N^> 



14 



V-°S TolOCO— <^0j 
OCOTol 15 QCOTol 



-NHCOCF3 



NaOMe/MeOH 



V 



HaN N h/'O 



HN '-Np, N ^ HCOCFa H N' 

OH OH 

17 



OH 18 



MeOH 



16 



Pd-OH, H 2 (g) 

-NHCOCF3 



-NHCOCF3 




HN 

H 2 N^N^N 
HO-^OJ 



(MeO^CHNMej 



DMF 



Me 2 N^N^N w 
HO- 



19 



OH 



(H 3 C) 2 N- 

DMTrO- 




NHCOCF3 



J N N 
OH 

20 

DMTrCI/Pyridine 



O \ 1 — NHCOCF3 

Ai^N^N C.P[Q(CH^CNIIN(iPr) 2l H HN^Nj-^i 

MTrO— v^oJ QHjNOPrh (H 3 C) 2 N-^ N ^ N -^N 



w 



22 



0(CH 2 ) 2 CN 



DMTiO-^s^O^I 



21 OH 



In Reaction Scheme 4, 6-amino-4-methoxypyrazolo[3,4-d]pyrimidine was 
5 converted to the iodo derivative (14) which was treated with l-chloro-l,2-dideoxy-3,5-di-0- 
toluoylribofuranose to yield the nucleoside (15). Compound (15) was treated with 
NaOMe/MeOH to hydrolyze the toluoyl groups and yield the unblocked nucleoside (16). 
Treatment of (16) with aqueous sodium hydroxide gave (17) which could be converted to the 
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trifluoro-N-prop-2-ynyl acetamide derivative (18). Compound (18) was reduced with 
hydrogen and a Pd catalyst to yield the trifluoro-N-propylacetamide derivative (19). The 4- 
amino group of compound (19) was protected by treatment with HN-dimethylformamide 
dimethylacetal to give compound (20) which was converted to DMTr derivative (21) and 
5 then to the phosphoramidite derivative (22). 

Compounds of Formula 4 (3-[((2R,5R)-5-(6-amino-4-oxo-3-prop-l-ynyl(5- 
hydropyrazolo[3,4-d]pyrimidinyl))-2-{[bis(4-methoxyphenyl)phenylmetho 
methyl} oxolan-3-yloxy)[bis(methylethyl)amino]phosphinooxy]propanenitrile) (PPPG 
phosphoramidite) can be synthesized using some of the reactions in Reaction Scheme 4. 



10 




Formula 4 Formula 5 

In the conversion of (17) to (18), prop-l-yne is used in place of 2,2,2-trifluoro- 
N-prop-2-ynylacetamide. The 6-amino group can be protected as described for compound 
(20) and the S'-hydroxyl can be protected with a DMTr group as described for compound 
1 5 (21). Finally the phosphoramidite can be prepared as described for compound (22) to yield 
the compound of Formula 4. In a similar way compounds of Formula 5 where R 6 is 
-OCOPhCH 3 or -NHCOCF3. can be prepared using related reactions known in the art. 

Reaction Scheme 5 illustrates the preparation of 3-{[5-(4,6-bis{(lE)-l-aza-2- 
[bis(2-methylpropyl)amino] vinyl} -3 -prop- 1 -ynylpyrazolo [3 ,4-d]pyrimidinyl)-2- { [bis(4- 
20 methoxyphenyl)phenylmethoxy] -methyl} oxolan-3-yloxy] [ethyl(methylethyl)amino } - 
phosphino}propanenitrile (26). 
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Reaction Scheme 5 




26 

Compound (26) can be synthesized starting from (NH 2 ) 2 PPPA (23) (see, Seela 
& Driller, Helv. Chim. Acta 71:757-761(1988)). Compound (23) can be converted to the 
5 bis(methylethyl)amino derivative (24) (Vincent et al, /. Org, Chem., 64:991-997 (1999)), 
followed by reaction first with DMTrCl to yield (25) that can be converted to the 
phosphoramidite (26). 

Reaction Scheme 6 provides the synthesis of protected 3-substituted 
pyrazolo[3 9 4-d]pyrimidines in which the substituent is a heteroalkyl group. 
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Reaction Scheme 6 




Reaction Scheme 7 provides the synthesis of protected 3 -substituted 



pyrazolo[3,4-d]pyrimidine in which the substituent is a methoxypropynyl group. The 3-iodo 
5 diamino derivative (23) was reacted with Pd(PPh 3 )4-CuI, triethylamine in anhydrous DMF 
followed by methyl propargyl ether to yield the 3-methoxypropynyl-2,4- 
diaminopyrazolo[3,4-d]pyrimidine derivative (27). The amino groups in this compound were 
blocked by reaction with N,N-dimethylformamide dimethylacetal to yield (28). The blocked 
nucleoside was first reacted with dimethoxytrityl chloride and then with 2-cyanoethyl 
10 diisopropylchlorophosphoramidite to yield the desired blocked phosphoramidite (29). 
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Reaction Scheme 7 
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The following reaction schemes provide methods for the preparation of 3- 
heterocycle substituted pyrazolo[3 3 4-d]pyrunidines. The general methods provided herein 
5 can be adapted for the preparation of other heterocyclic substituents. 
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Reaction Scheme 8 




Reaction Scheme 8 provides the synthesis of protected 3-substituted 
pyrazolo[3,4-d]pyrimidine in which the substituent is a furanyl group. Malonitrile was 
5 reacted with 2-furfiiryl chloride in the presence of a base, followed by treatment with 

dimethylsulfide to give the methoxy dinitrile derivative (30). Reaction of (30) with hydrazine 
yielded the substituted pyrazole (31) which was reacted with formamide to give 3-(2- 
fiuyl)pyrazolo[3,4-d]pyrimidine-4-ylamine (32). Base (32) was treated with l-chloro-1,2- 
dideoxy-3,5-di-0~toluoylribofuranose to yield the blocked nucleoside (33). Compound (33) 
10 was first treated with NaOMe/MeOH to hydrolyze the toluoyl groups to yield the unblocked 
nucleoside which was reacted with N,N-dimethylformamide dimethylacetal to yield the 
protected nucleoside derivative (34). This derivative was first reacted with dimethoxytrityl 
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chloride and then with 2-cyanoethyl diisopropylchlorophosphoramidite to yield the desired 
blocked phosphoramidite (35). 



Reaction Scheme 9 

i) koh/dmf/cHjCN 



5 




Reaction Scheme 9 provides the synthesis of protected 3-substituted 2,4- 
diamino pyrazolo[3,4-d]pyrimidine in which the substituent is a furanyl group. The 
carbonitrile (31) was reacted with guanidinium carbonate to yield 3-(2-furyl)pyrazolo[3,4- 
d]pyrimidine-4,6-diamine (36). This base was treated with l-chloro-l,2-dideoxy-3,5-di-0- 

1 0 toluoylribofiiranose to yield the blocked nucleoside which was treated with NaOMe/MeOH 
to hydrolyze the toluoyl groups and yield the unblocked nucleoside derivative (37). The 
latter compound was reacted with N,N-dimethylformamide dimethylacetal to yield the 
protected nucleoside derivative (38). This derivative was first reacted with dimethoxytrityl 
chloride and then with 2-cyanoethyl diisopropylchlorophosphoramidite to yield the desired 

1 5 blocked phosphoramidite (39). 
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Reaction Scheme 10 




Reaction Scheme 10 provides one synthesis route to a hydroxy-protected 
HOBU-phosphoramidite. In this scheme, 3-butyn-l-ol is converted to its /Moluoyl ester (40) 
5 with p-toluoyl chloride in the presence of pyridine. The protected butynol is combined with 
5-iodo-2'-deoxyuridine in the presence of tetrakis(triphenylphosphine)palladium(0) and 
copper iodide, then protected (as a DMT ether) to form 41, which can then be converted to its 
corresponding phosphoramidite derivative 42. Reagent 42 can be used directly in the 
preparation of modified oligonucleotides. Removal of the /?-toluoyl protected group can be 
1 0 accomplished according to standard methods. 

In each of the above schemes, one of skill in the art will understand that other 
protecting groups and/or activating groups can be used. Additionally, different non-natural 
base-containing monomers, with different functional groups, can require different blocking 
groups for successful synthesis of the modified oligonucleotides. A variety of useful 
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protection groups, their synthesis and de-protection methods are described in, for example, 
Beaucage & Iyer, Tetrahedron 48:2223-231 1 (1992). 

The synthesis of oligonucleotides, and modified oligonucleotides can be 
initiated from a solid support containing a cleavable linker to which is attached the first base. 
The oligonucleotides of the invention can be synthesized containing a substituted 
pyrazolo[3,4-d]pyrimidine as the first nucleotide at the 3'-end, using an appropriate 3- 
substituted pyrazolo[3,4-d]pyrimidine. Alternatively, a 5-substituted pyrimidine can be 
attached initially to a solid support. Reaction Scheme 1 1 illustrates a general approach to the 
synthesis of a protected 3-substituted pyrazolo[3,4-d]pyrimidine dA CPG derivative (C). 
Intermediate (A) can be prepared using procedures described in Reaction Scheme 3, while 
(B) and (C) can be synthesized as described in co-pending U.S. application Ser. No. 
09/457,616. 



Reaction Scheme 11 
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In this aspect of the invention, a variety of solid supports are useful, provided 
the support is compatible with automated oligonucleotide techniques and include, glass, 
polystyrene, nylon, plastic and the like. Additonally, the present invention provides in a 

5 related aspect, a solid support (e.g., controlled pore glass (CPG)) having an attached modified 
base, preferably those of formulae Ic, Id, Ie, If, lib, He, lid, lie. More preferably, the attached 
modified base is a 3-substituted pyrazolo[3,4-d]pyrimidine or a 5-substituted pyrimidine 
moiety wherein the substituents are selected from the preferred groups described above, and 
further including the novel bases described in more detail below. 

1 0 The present invention further includes those compositions and methods 

wherein any of the unsubstituted and/or 3-substituted pyrazolo[3,4-d]pyrimidines and/or 5- 
substituted pyrimidines are combined or used in combination with other modified bases 
known in the art. Other modified modified monomeric units have been disclosed Scheit, 
Nucleotide Analogs, John Wiley, New York, 1980; Uhlman and Meyman, Chemical 

15 Reviews, 90:543-584 (1990), Seela and Debelak, Nucl Acids Res., 28:3224-3232 (2000); 
Balow et al, Nucl. Acids Res., 26:3350-3357 (1998); Bolli et al Nucl Acids Res., 24:4660- 
4667 (1996). 

Modified Oligonucleotide Arrays 

20 

In another embodiment of the present invention, modified oligonucleotides are 
used in procedures which utilize arrays of oligonucleotides, such as sequencing by 
hybridization and array-based analysis of gene expression. A variety of arrays are 
contemplated by the present invention including, for example, chip or platform arrays, bead 

25 arrays, liquid phase arrays, 'zip-code* arrays and the like. In sequencing by hybridization, an 
ordered array of oligonucleotides of different known sequences is used as a platform for 
hybridization to one or more test polynucleotides, nucleic acids or nucleic acid populations. 
Determination of the oligonucleotides which are hybridized and alignment of their known 
sequences allows reconstruction of the sequence of the test polynucleotide. Alternatively, 

30 oligonucleotides comprising the wild-type sequence and all possible mutant sequences for a 
given region of a gene of interest can be placed on an array. Exposure of the array to DNA or 
RNA from a subject or biological specimen, under hybridization conditions, allows 
determination of wild-type or mutant status for the gene of interest. See, for example, U.S. 
Patent Nos. 5,492,806; 5,525,464; 5,556,752; and PCT Publications WO 92/10588 and WO 
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96/17957. Both of these techniques require discrimination between related sequences, 
especially at the single-nucleotide level; hence, the enhanced discriminatory properties of the 
modified oligonucleotides of the invention will provide improvements in these techniques. 
Materials for construction of arrays include, but are not limited to, nitrocellulose, glass, 
silicon wafers, optical fibers and other materials suitable for construction of arrays such as 
are known to those of skill in the art. The synthesis of oligonucleotides arrays has been 
described in co-pending U.S. application Ser. No. 09/364,320, and suitable modifications can 
be made to the methods therein for preparing the arrays of the present invention. 

An additional application of the present invention to array technology is in the 
examination of patterns of gene expression in a particular cell or tissue. In this case, 
oligonucleotides or polynucleotides corresponding to different genes are arrayed on a surface, 
and a nucleic acid sample from a particular cell or tissue type, for example, is incubated with 
the array under hybridization conditions. Detection of the sites on the array at which 
hybridization occurs allows one to determine which oligonucleotides have hybridized, and 
hence which genes are active in the particular cell or tissue from which the sample was 
derived. 

Array methods can also be used for identification of mutations or 
polymorphisms, where wild-type and mutant sequences are placed in an ordered array on a 
surface. Hybridization of a polynucleotide sample to the array under stringent conditions, 
and determination of which oligonucleotides in the array hybridize to the polynucleotide, 
allows determination of whether the polynucleotide possesses the wild-type or the mutant 
sequence. Since many mutant sequences of clinically-relevant genes differ from their wild- 
type counterpart at only one or a few nucleotide positions, the enhanced discriminatory 
powers of the modified oligonucleotides of the invention will provide improvements in 
mutation detection. 

In all of the above-mentioned applications of array technology, the increased 
discriminatory abilities of modified oligonucleotide provide significant improvements in 
sensitivity and resolving power. 

Efficient hybridization, for example in arrays, requires that the capture probes 
contain T m 's in a narrow temperature range. In one embodiment of the invention the T m of 
capture oligonucleotides for use in arrays is balanced, eliminating the problem associated 
with GC and AT-rich sequences, by using a combination of 3'-MGBs and modified bases 
(e.g., PPPA and PPPU). Table 1 and Figure 2 illustrate the balancing of the T m of 
pyrimidine rich probes with different G/C and A/T base pairs. To balance the T m of the 
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different probes, PPPA, PU and MGB were used in combination. The purine and pyrimidine- 
rich probes and complementary targets are shown in Table 1 and the T m s of these probes are 
shown in Figure 2. As shown in Figure 2 the T m difference between a GC-rich and an AT- 
rich probe can be more than 50°C. 

5 

Table 1 

Sequences of Oligonucleotide Probes and Complementary Targets 



base 

Complementary Targets py/p u . rich probes pairs 

1* TCGGCGGCGT 1\MGB-Q-CGCCGCCG 8 G/C 

2* ACAGCGGCGT 2* MGB-Q-CGCCGCTG 7 G/C, 1 A/T 

3* ACAGCGACGT 3*.MGB-Q-CGTCGCTG 6 G/C, 2 A/T 

4* TCAGTGACGA 4*.MGB-Q-CGTCACTG 5 G/C, 3 A/T 

5* TCAGTGACAA 5*.MGB-Q-TGTCACTG 4 G/C, 4 A/T 

6* TCAATGACAG 6*.MGB-Q-TGTCATTG 3 G/C, 5 A/T 

7* ACAATGATAA 7*.MGB-Q-TATCATTG 2 G/C, 6 A/T 

8* CCAATAATAA 8*.MGB-Q-TATTATTG 1G/C.7A/T 

9* GTAATAATAA 9*.MGB-Q-TATTATTA 8 A/T 

10 As can be seen in Figure 2, modification with PPPA, PU and a MGB provides 

a probe set having balanced T ra s. 

Efficient hybridization for example in arrays, requires that the probes contain 
T m 's in a narrow temperature range. Thus, in related embodiments, the modified 
oligonucleotides described herein can comprise any combination of normal bases, 

15 unsubstituted pyrazolo[3,4-d]pyrimidine bases (e.g., PPG and PPA), 3-substituted 
pyrazolo[3,4-d]pyrimidines, modified purine, modified pyrimidine, 5-substituted 
pyrimidines, universal bases and a minor groove binder to balance the T m ofthe 
oligonucleotide. In certain aspects, the modified oligonucleotides include one or more 
modified bases, and the nearest neighbor parameters and T m determination are preferably 

20 determined based on the inclusion of at least one modified base. Examples of modified bases 
include a base attached to an amino acid, a locked nucleic acid sugar with a normal base, a 
PNA backbone with a normal base, and any other modified base as described herein or any 
other modified bases as are apparent to one skilled in the art. In some aspects, the modified 
oligonucleotides include one or more of a quencher, a fluorophore and a minor groove binder 

25 attached thereto. 



41 



WO 03/107144 



PCT/US03/19447 



The ability to predict and "level" the T m s of multiple modified 
oligonucleotides is particularly useful for the development of oligonucleotide arrays or for 
compositions having a plurality of oligonucleotides. Still further, the ability to predict T m s 
for modified oligonucleotides is useful for establishing appropriate conditions for 
hybridization, renaturation, mapping variations of base compositions of sequences, 
determination of sequence complexity and divergence. A T m prediction algorithm can be 
developed using models that assign thermodynamic contributions to duplex formation for all 
possible individual base pair nearest neighbors. Thermodynamic parameters for 10 
independent nearest-neighbors of base pairs are available from published sources for 
DNArRNA hybrids (see Sugimoto et aL, Biochem. 34:1 121 1-11216 (1995)) and dsDNA 
(SantaLucia et al, Biochem. 36:10581-10595 (1997)). The use of the nearest neighbor 
parameters to develop programs to calculate oligonucleotide duplex stability in terms of T m 
have been described (see, Schutz & von Ahsen, Biotechniques, 27:1218-22 (1999); Owczarzy 
et al, Biopolymers, 44:217-239 (1997); for PNA/DNA, see Geisen et al, Nucl Acids Res., 
26:5004-5006 (1998); Blake et al, Bioinformatics, 15:370-5 (1999) and U.S. Patent No. 
5,556,749. 

The basic principles of thermodynamic and kinetic description of mismatch 
discrimination are well known and are summarized below. 
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A probe-target interaction is an equilibrium reaction that can be described as 
A + B <=> AB where A = probe, B = DNA target 

5 

At the temperature T equilibrium of forward (association) and reverse 
(dissociation) reactions can be described by equilibrium constant K where : 

[AB] 

10 K= ; 



fAJ/BJ 



in terms of equilibrium thermodynamics 

15 

K= exp(- AH°ab /RT + AS° AB /R); 
ACT AB = AH°ab — TAS° ab y 
AG* ab— -RTLn(K) 

20 where : AH°ab , AS°ab and AG°ab - enthalpy, entropy and free energy changes for duplex 
AB formation. 

The duplex AB has N base pairs (N-l nearest neighbors) and the nearest- 
neighbor approximation assumes 

25 AH° AB = AH° 1 +AH° 2 +... AH° n .j +AH° init =Z(AH°i +AH° init ) 

AS°ab — AS°i + AS° 2 +... AS° n .j ^ASTmu =Z(AS?i +AS° init ) 
ACT ab^ ACT j +AG° 2 +... A(f n -i +AG? init =Z(AGr t +A€? in u) 

where : AH°i , AS* and AG°i - enthalpy, entropy and free energy changes for each nearest- 
30 neighbor base pairs increment for duplex AB formation, AH°i n it , AS° in it and AG° in it - 
thermodynamic parameters of initiation of duplex AB formation. 

In the case of total concentration equality, Cprobe= C larg et , an equation for Tm 
in terms of free energy, enthalpy and entropy can be arrived at as shown below: 

35 ZCAET^AH 0 ^) 

TM= 

£ (AS°i + AS°init ) + RLn[(C p robe+C t arged/4] 

Similarly as shown above relationships can be developed when the probe 
40 hybridizes to a target that contains a mismatch, as shown below: 
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A + B* o AB* A = probe, B* = mismatched DNA target 

Thermodynamic parameters for mismatched duplex AB* are: K*, AH°ab* 
, AS°ab* and AG° A b* - In terms of nearest-neighbor parameters for a mismatched base pair 
at position "4", for example, involving nearest-neighbor "3" and "4" the discrimination 
parameters AAH° , AAS° , AAG° can be expressed as: 

AAH°=AH° AB -AH° AB * = (AH° 3 - AH°* 3 ) + (AH° 4 - AH°* 4 ) = AAH° 3 + AAH° 4 . 
AAS° ~ AS°ab -AS° AB * = (AST 3 - AST* S ) + (AST 4 • AS°* 4 ) = AAS° 3 + AAS° 4 . 
AAG° = A(? AB - AG°ab* = (AG? 3 - AG?* 3 ) + (A<? 4 - AG°* 4 ) - AAG° 3 + AAG° 4 . 

In one embodiment of the invention, probe sequences are designed to 
determine the thermodynamic nearest-neighbor parameters for oligonucleotides containing 
normal purines, pyrimidines and a minor groove binder CDPI3 attached to the 3 '-end of an 
oligonucleotide. In another embodiment thermodynamic nearest-neighbor parameters for 
oligonucleotide containing normal pyrimidines, purines, a 3 '-end attached CDPI 3 , but 
wherein the guanine is replaced with 5-hy<kopyrazolo[3,4-d]pyrimin-4-one. These nearest- 
neighbor thermodynamic parameters are used in these two embodiments to calculate the T m s 
of these types of oligonucleotides with and without CDPI3. In the case where the CDPI3 was 
attached at base 1 at the 3 '-end the new T m prediction algorithm and formula were used as 
derived below. 

T M - 

E(AST i +ASr in »)+RLn[(C probe +C ta ^ 

The model assumes that a MGB attached to an oligonucleotide has an 
additional pure entropic factor AAS°mgb. This value can be calculated according to - 
algorithm : 6 bases (5 nearest-neighbor ) from 3 '-prime end are covered by the MGB from 
starting base "1" or "2". Each nearest-neighbors covered has unique value AAS 0 jMGB . 
Correction factor "A" are introduced in the case when adenosine can be found in base 
positions 6, 7 or 8. Statistical factor "n" is introduced additionally to factor "A" were n=3 for 
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-AAA-; n=2 for -AA-, -ANA-; n=l for -A-; n=0 for -AN- and -NA- (N is any base other than 
A). A final value AAS° M gb can be determined by the equation 

mgb = ZAAS? mG n + A(optional) + R*Ln(N) (optional) 

Tables 2a and 2b contain the nearest-neighbor thermodynamic parameters for 
oligonucleotides containing PPG alone and the entropic contribution of the MGB, 
respectively. In the latter case the contribution of fluorescein and a dark quencher is 
incorporated in the thermodynamic treatment and has generally been shown to be negligible. 
The quencher is part of the linker between the MGB and the oligonucleotide. To those 
skilled in the art it will be appreciated that when a group is covalently attached to the 
oligonucleotide, or oligopolymer, the nearest-neighbor parameters may be influenced in the 
area where the group is attached. Preferably the nearest neighbor parameters are incluenced 
within 2 bases, although influence as far as 6 bases or more may occur. For example, where 
a minor groove binder spans from 5 to 6 bases in a duplex, the nearest neighbor parameters 
may be influenced within 6 bases due to the minor groove binder. 

Tables 2a and 2b 

a) Nearest-neighbor parameters for PPG containing oligonucleotides 
b) The entropic contribution of the CDPI 3 . 



a) 



Number 




AH 0 


AS° 


AG°{65) 


1 


AA 


-7850 


-22.3 


-324 






cal/mol 


caUM/E 0 


cal/M 


2 


AT 


-8180 


-23.0 


-411 


3 


AC 


-8450 


-22.6 


-795 


4 


AG 


•6560 


-17.2 


-730 


5 


TA 


-7230 


-21.8 


129 


6 


TT 


-7850 


-22.3 


-324 


7 


TC 


-8720 


-24.0 


-614 


8 


TG 


-6900 


-18.3 


-715 


9 


CA 


7530 


-20.3 


-652 


10 


CT 


-6390 


-17.3 


-548 


11 


CC 


-7860 


-20.1 


-1071 


12 


CG 


-6030 


-14.0 


-1288 


13 


GA 


-10070 


-27.4 


-795 


14 


GT 


-9110 


-23.5 


-1175 


15 


GC 


-13170 


-34.0 


-1665 


16 


GG 


-8080 


-20.2 


-1258 


17 


GCinit 


-48440 


-17.6 


1116 


18 


ATinit 


-1060 


-7.1 


1341 



45 



WO 03/107144 



PCT/US03/19447 









Number 


5'-3' 


AAS° 






cal/M/K° 


1 


AA 


3.408 


2 


AT 


3.060 


3 


AC 


1.442 


4 


AG 


0.750 


5 


TA 


2.463 


6 


TT 


3.313 


7 


TC 


2,870 


8 


TG 


0.893 


9 


CA 


0.607 


10 


CT 


2.253 


11 


CC 


0.905 


12 


CG 


-0.721 


13 


GA 


1.282 


14 


GT 


2.397 


15 


GC 


2.172 


16 


GG 


1.298 



In one preferred embodiment T m s of multiple modified oligonucleotides containing the same 
number of bases are leveled using an algorithm to select nearest neighbor parameters from 
any combination of normal bases, universal bases, PPA, PPG, PPPA, PPPG, PU, PC, HOPU, 
5 HOBuU, HOBuC, (NH 2 ) 2 PPPA, (NH 2 ) 2 PPPAOH, (NH 2 ) 2 BuPPAOH, (NH 2 ) 2 PPAI, 

HOBuPPG, (NH 2 ) 2 PPAI, (NH 2 ) 2 PPABr, (NH 2 ) 2 PPAC1), PP AI, PPABr, PPAC1, bases having 
a minor groove binder, a fluorophore, a quencher, a modified sugar, a modified backbone and 
a chemilumenescer. As indicated in Tables 2a and 2b, the calculation of nearest neighbor 
parameters involves the measurement of the thermodynamic parameters, e.g., enthalpy (AH 0 ) 

10 and enthropy (AS 0 ), that influence the free energy change (AG 0 ) of the groups assembled in 
the oligonucleotide, oligomer or chimeria. These include groups such as minor groove 
binders, fluorophores, quenchers and the like. 

Alternatively, the T m s of multiple modified oligonucleotides are substantially 
leveled for containing the same number of bases plus or minus 1 or 2 bases, with a Tm range 

15 of about ±2°C. 

Alternatively, the T m s of an array of oligonucleotides, oligomers or chimeras 
are substantially balanced, when one or more base, backbone or sugar is changed to achieve 
the leveling. 

In some instances, modified bases are used that improve duplex stability in 
20 addition to those modified bases that decrease duplex stability. Modified bases that decrease 
duplex stability are well known, e.g. 7-deazaadenine and 7-deazaguanine. 
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Use of Modified Oligonucleotides 

The modified oligonucleotides of the present invention provides numerous 
5 advantages over unmodified oligonucleotides, including superior mismatch discrimination. 
The modified oligonucleotides of the invention are particularly useful as probes, wherein 
their hybridization to a target sequence is detected, or as primers, wherein their hybridization 
to a target sequence is followed by polynucleotide synthesis initiated from the 3' terminus of 
the modified oligonucleotide, and the synthesized product (le. , the extension product) is 
10 detected 

The modified oligonucleotides of the present invention are useful in other 
techniques in which hybridization of an oligonucleotide to another nucleic acid is involved. 
These include, but are not limited to, techniques in which hybridization of an oligonucleotide 
to a target nucleic acid is the endpoint; techniques in which hybridization of one or more 

1 5 oligonucleotides to a target nucleic acid precedes one or more polymerase-mediated 

elongation steps which use the oligonucleotide as a primer and the target nucleic acid as a 
template; techniques in which hybridization of an oligonucleotide to a target nucleic acid is 
used to block extension of another primer; techniques in which hybridization of an 
oligonucleotide to a target nucleic acid is followed by hydrolysis of the oligonucleotide to 

20 release an attached label; and techniques in which two or more oligonucleotides are 

hybridized to a target nucleic acid and interactions between the multiple oligonucleotides are 
measured. Conditions for hybridization of oligonucleotides, and factors which influence the 
degree and specificity of hybridization, such as temperature, ionic strength and solvent 
composition, are well-known to those of skill in the art. See, for example, Sambrook et al, 

25 supra; Ausubel, et al, supra; M.A. Innis et al (eds.) PCR Protocols, Academic Press, San 
Diego, 1990; B.D. Hames et al (eds.) Nucleic Acid Hybridisation: A Practical Approach, 
IRL Press, Oxford, 1985; and van Ness etal (1991) Nucleic Acids Res. 19:5143-5151. 

Hybridization of probes and/or primers to target sequences proceeds according 
to well-known and art-recognized base-pairing properties, such that adenine base-pairs with 

30 thymine or uracil, and guanine base-pairs with cytosine. The property of a nucleotide that 
allows it to base-pair with a second nucleotide is called complementarity. Thus, adenine is 
complementary to both thymine and uracil, and vice versa; similarly, guanine is 
complementary to cytosine and vice versa. An oligonucleotide which is complementary 
along its entire length with a target sequence is said to be perfectly complementary, perfectly 
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matched, or folly complementary to the target sequence, and vice versa. An oligonucleotide 
and its target sequence can have related sequences, wherein the majority of bases in the two 
sequences are complementary, but one or more bases are noncomplementary, or mismatched. 
In such a case, the sequences can be said to be substantially complementary to one another. 
5 If the sequences of an oligonucleotide and a target sequence are such that they are 

complementary at all nucleotide positions except one, the oligonucleotide and the target 
sequence have a single nucleotide mismatch with respect to each other. 

The modified pyrazolo[3,4-d]pyrimidine nucleotides of the invention retain 
the base-pairing specificity of their naturally-occurring analogues; PPPG analogues are 
10 complementary to cytosine, while PPPA analogues are complementary to thymine and uracil. 
The PPPG and PPPA analogues not only have a reduced tendency for so-called 'Nvobble" 
pairing with non-complementary bases, compared to guanine and adenine, but the 3- 
substituted groups increase binding affinity in duplexes. Similarly, modified pyrimidines 
hybridize specifically to their naturally occurring counter partners, 
1 5 Conditions for hybridization are well-known to those of skill in the art and can 

be varied within relatively wide limits. Hybridization stringency refers to the degree to 
which hybridization conditions disfavor the formation of hybrids containing mismatched 
nucleotides, thereby promoting the formation of perfectly matched hybrids or hybrids 
containing fewer mismatches; with higher stringency correlated with a lower tolerance for 
20 mismatched hybrids. Factors that affect the stringency of hybridization include, but are not 
limited to, temperature, pH, ionic strength, concentration of organic solvents such as 
formamide and dimethylsulfoxide and chaotropes. As is well known to those of skill in the 
art, hybridization stringency is increased by higher temperatures, lower ionic strengths, and 
lower solvent concentrations. See, for example, Ausubel et al, supra; Sambrook et aL, 
25 supra; M.A. Innis et al (eds.) PCR Protocols, Academic Press, San Diego, 1990; B.D. 
Hames et al (eds.) Nucleic Acid Hybridisation: A Practical Approach, IRL Press, Oxford, 
1985; and van Ness a/., (1991) 19:5143-5151. 

Thus, in the formation of hybrids (duplexes) between an oligonucleotide and 
its target sequence, the oligonucleotide is incubated in solution, together with a 
30 polynucleotide containing the target sequence, under conditions of temperature, ionic 
strength, pH, etc, that are favorable to hybridization, i.<?., under hybridization conditions. 
Hybridization conditions are chosen, in some circumstances, to favor hybridization between 
two nucleic acids having perfectly-matched sequences, as compared to a pair of nucleic acids 
having one or more mismatches in the hybridizing sequence. In other circumstances, 
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hybridization conditions are chosen to allow hybridization between mismatched sequences, 
favoring hybridization between nucleic acids having fewer mismatches. 

The degree of hybridization of an oligonucleotide to a target sequence, also 
known as hybridization strength, is determined by methods that are well-known in the art. A 

5 preferred method is to determine the T m of the hybrid duplex. This is accomplished, as 
described supra, by subjecting a duplex in solution to gradually increasing temperature and 
monitoring the denaturation of the duplex, for example, by absorbance of ultraviolet light, 
which increases with the unstacking of base pairs that accompanies denaturation. T m is 
generally defined as the temperature midpoint of the transition in ultraviolet absorbance that 

1 0 accompanies denaturation. Alternatively, if T m s are known, a hybridization temperature (at 
fixed ionic strength, pH and solvent concentration) can be chosen that it is below the T m of 
the desired duplex and above the T m of an undesired duplex. In this case, determination of 
the degree of hybridization is accomplished simply by testing for the presence of hybridized 
probe. 

15 If a probe comprises a detectable label, assays for hybridized probe are usually 

designed to detect the presence of label in duplex material. This can be accomplished, for 
example, by specifically selecting duplex material, specifically destroying single-stranded 
material, or utilizing some combination of these methods. For example, hybridization 
reaction mixtures can be subjected to high-stringency conditions and/or single strand-specific 

20 nucleases; or duplexes can be purified by affinity techniques specific for double-stranded, as 
opposed to single-stranded, nucleic acids. In a preferred embodiment of the invention, 
duplexes are detected by release of label from a probe under conditions in which label is 
released only when the probe is in a duplex. Another embodiment requires the seperation of 
the label and quenchers when hybridized to the target. 

25 Detectable labels or tags suitable for use with nucleic acid probes are well- 

known to those of skill in the art and include, but are not limited to, radioactive isotopes, 
chromophores, fluorophores, chemiluminescent and electrochemiluminescent agents, 
magnetic labels, microspheres, colloidal metal(Taton et al, Science 289:1757-1760 (2000)), 
immunologic labels, ligands and enzymatic labels. Suitable labels also include mass labels 

30 and those used in deconvolution of combinatorial chemistry libraries, for example, tags that 
can be recognized by high performance liquid chromatography (HPLC), gas chromatography, 
mass spectrometry, optical imaging fibers, surface plasmon resonance, correlation 
spectroscopy, nanotechnology (Guetence et al, J. Chromatogr. B. Biomed. ScL Appl. 
739:139-150 (2000)) and the like. 
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Methods for labeling of oligonucleotides are well-known to those of skill in 
the art and include, for example, chemical and enzymatic methods. By way of example, 
methods for incorporation of reactive chemical groups into oligonucleotides, at specific sites, 
are well-known to those of skill in the art. Oligonucleotides containing a reactive chemical 
group, located at a specific site, can be combined with a label attached to a complementary 
reactive group {e.g., an oligonucleotide containing a nucleophilic reactive group can be 
reacted with a label attached to an electrophilic reactive group) to couple a label to a probe by 
chemical techniques. Exemplary labels and methods for attachment of a label to an 
oligonucleotide are described, for example, in U.S. Patent No. 5,824,796; U.S. Patent No. 
5,210,01 5; Kessler (ed.), Nonradioactive Labeling and Detection ofBiomolecules, Springer- 
Verlag, Berlin, 1992; Kricka (ed.) Nonisotopic DNA Probe Techniques, Academic Press, San 
Diego, 1992; Howard (ed.) Methods in Nonradioactive Detection, Appleton & Lange, 
Norwalk, 1993. Non-specific chemical labeling of an oligonucleotide can be achieved by 
combining the oligonucleotide with a chemical that reacts, for example, with a particular 
fiinctional group of a nucleotide base, and simultaneously or subsequently reacting the 
oligonucleotide with a label. See, for example, Draper et al (1980) Biochemistry 19:1774- 
1781. Enzymatic incorporation of label into an oligonucleotide can be achieved by 
conducting enzymatic modification or polymerization of an oligonucleotide using labeled 
precursors, or by enzymatically adding label to an already-existing oligonucleotide. See, for 
example, U.S. Patent No. 5,449,767. Examples of modifying enzymes include, but are not 
limited to, DNA polymerases, reverse transcriptases, RNA polymerases, etc. Examples of 
enzymes which are able to add label to an already-existing oligonucleotide include, but are 
not limited to, kinases, terminal transferases, ligases, glycosylases, etc. 

If an oligonucleotide is capable of acting as a primer, the degree of 
hybridization of the oligonucleotide can also be determined by measuring the levels of the 
extension product of the primer. In the case, either the primer can be labeled, or one or more 
of the precursors for polymerization (normally nucleoside triphosphates) can be labeled. 
Extension product can be detected, for example, by size (e.g., gel electrophoresis), affinity 
methods, or any other technique known to those of skill in the art. 

Primer extension ("minisequencing", "genetic bit analysis") assays are 
commonly used for SNP typing and have the potential to be used in other genotyping and 
mutation screening applications (Pastinen T. et al., Genome Res., 10:1031-42 (2000)). In 
certain embodiments modified bases and minor groove binders improve primer extension 
assays in several ways. The added duplex stability provided by MGB, or 5-substituted 
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pyrimidine or 3-substituted pyrazolo[3,4-d]pyrimidine enables extensions to be performed at 
elevated temperatures. This is advantageous because problematic secondary structures in 
target molecules are eliminated at elevated temperatures. Also, hybridization of target to 
primer is faster at higher temperature. Thermostable polymerases such as Taq polymerase 
5 and Bst DNA polymerase may be used in such reactions. 

Furthermore, MGBs and modified bases improve the specificity of assays by 
eliminating one class of false postitive signals. Primer sequences that form hairpin structures 
or homodimers are prone to template-independent extension (the 5' end of the primer 
functions as template), resulting in false positive signal. MGBs and modified bases on 

10 "templates" inhibit extension by DNA polymerases. Thus, MGBs on the 5 ' end, or modified 
bases on the 5' end or middle of a primer, can prevent extension (false positives) from primer 
hairpins or primer dimers. Finally, PPG can be used to eliminate non-canonical structures 
formed by G-rich oligonucleotides, enabling primer extension assays in such sequences. 

Other assays in which the present modified oligonucleotides are particularly 

1 5 useful are described in co-pending application Ser. No. 09/054,832. 

Still other amplification assays in which modified oligonucleotides are useful 
include the amplification assays based on the invasive cleavage of oligonucleotide probes by 
flap endonucleases (Lyamichev et al., Nature BiotechnoL, 17:292-296 (1999) and Olson, et 
al., High-Throughput Gene Expression Monitoring with the Invader® Assay, Poster, Society 

20 for Biomolecular Screening Conference, Vancouver, British Columbia, Canada, 2000); self- 
sustained sequence replication type assays (Mueller et al, Histochem. Cell Biol, 108:431-437 
(1997)) and the like. Surprisingly, non-natural bases can be substituted in both the invader 
and genomic probes of a cleavase-based assay. These modifications include but are not 
limited to pyrazolo[3,4-d]pyrimidines, 3-substituted pyrazolo[3,4-d]pyrimidines and 5- 

25 substituted pyrimidines. Non-natural backbones are also included such as monomers used in 
peptide nucleic acids, locked nucleic acids etc. Still further, the modified oligonucleotides 
can have attached minor groove binders, fluorophores, quenchers, and the like. One of skill 
in the art will appreciate that chimeras can also be used to allow optimal enzyme activity and 
performance. 

30 In the present invention, marked improvements are unexpectedly shown in 

cleavase-based assays, when modified bases were substituted for normal bases in both the 
invader and genomic probes. Thus, the use of the modified bases such as 
hydoxypropynylPPA (HOPPPA), (NH 2 ) 2 PPPAOH and 3-iododiaminoPPA ((NH 2 ) 2 PPAI,) 
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allowed one to decrease the length of both the invader and genomic probes, yet provide 
improved assay performance. In another embodiment non-natural bases are also incorporated 
in the cassette probe. 

In view of the above, the present invention provides in one aspect, a method 
5 for distinguishing polynucleotides with related sequences, the method comprising: 

(a) contacting a modified oligonucleotide having a defined sequence 
comprising at least one 3-substituted pyrazolo[3,4-rf]pyrimidine or 5-substituted pyrimidine 
in place of a purine or pyrimidine base with at least two polynucleotides, wherein one of the 
polynucleotides has a target sequence that is perfectly complementary to the modified 

10 oligonucleotide and at least one of the other polynucleotides has a target sequence with at 
least one base mismatch; and 

(b) determining the degree of hybridization between the modified 
oligonucleotide and each of the polynucleotides. 

Preferably, at least one of the other polynucleotides has a target sequence with 
1 5 one or two base mismatches, more preferably only one base mismatch. 

As noted above, a target sequence refers to a nucleotide sequence which 
comprises a site of hybridization for a probe or a primer. Target sequences can be found in 
any nucleic acid including, but not limited to, genomic DNA, cDNA, RNA and any amplified 
product thereof, and can comprise a wild-type gene sequence, a mutant gene sequence, a non- 
20 coding sequence, a regulatory sequence, etc. A target sequence will generally be less than 
1 00 nucleotides, preferably less than 50 nucleotides, and most preferably, less than 2 1 
nucleotides in length. 

The modified oligonucleotides used in this aspect of the invention are 
essentially modified probes and the polynucleotides can be distinguished by determining 
25 which polynucleotides hybridizes to the modified probe. The modified probes can be labeled 
with any detectable label, or the probe can have the capacity to become labeled either before 
or after hybridization, such as by containing a reactive group capable of association with a 
label or by being capable of hybridizing to a secondary labeled probe, either before or after 
hybridization to the target. Conditions for hybridization of nucleic acid probes are well- 
30 known to those of skill in the art. See, for example, Sambrook et al, supra; Ausubel et al., 
supra; Innis et al, supra; Hames et al. supra; and van Ness et al, supra. 

Hybridization can be assayed {i.e., hybridized nucleic acids can be identified) 
by distinguishing hybridized probe from free probe by one of several methods that are well- 
known to those of skill in the art. These include, but are not limited to, attachment of target 
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nucleic acid to a solid support, either directly or indirectly (by hybridization to a second, 
support-bound probe or interaction between surface-bound and probe-conjugated ligands) 
followed by direct or indirect hybridization with probe, and washing to remove unhybridized 
probe; determination of nuclease resistance; buoyant density determination; affinity methods 

5 specific for nucleic acid duplexes (e.g., hydroxyapatite chromatography); interactions 
between multiple probes hybridized to the same target nucleic acid; etc. See, for example, 
Falkow et al, U.S. Patent No. 4,358,535; Urdea et al, U.S. Patent Nos. 4,868,105 and 
5,124,246; Freifelder, Physical Biochemistry, Second Edition, W. H. Freeman & Co., San 
Francisco, 1982; Sambrook, et al, supra; Ausubel et al., supra; Hames et al, supra; and 

1 0 other related references. The duplex-stabilizing capability of MGB-modified oligonucleotide 
conjugates makes hybridization possible under more stringent conditions, wherein potentially 
occluding secondary structure in the target nucleic acid can be minimized. Accordingly, such 
MGB-modified oligonucleotides are particularly preferred in this aspect of the invention. 

In a related aspect, the present invention provides a method for detecting the 

1 5 presence of a target sequence in a polynucleotide, the method comprising: 

(a) incubating a polynucleotide to be tested for the presence of the target 
sequence with a modified oligonucleotide having a sequence that is substantially 
complementary to the target sequence under hybridization conditions; and 

(b) identifying hybridized nucleic acids; 

20 wherein the modified oligonucleotide comprises at least one 3-substituted 

pyrazolo[3,4-^pyrimidine in place of a purine residue. 

Preferably, the modified oligonucleotide is a labeled probe and has at least two 
3-substituted pyrazolo[3,4-d]pyrimidine bases. In this group of embodiments, a labeled 
probe is hybridized to a target and/or an extension product of a target, and a change in the 

25 physical state of the label is effected as a consequence of hybridization. As used herein, a 
"probe" is a nucleic acid molecule that is capable of hybridizing to a target sequence in a 
second nucleic acid molecule. By way of example, one assay of this type, the hydrolyzable 
probe assay, takes advantage of the fact that many polymerizing enzymes, such as DNA 
polymerases, possess intrinsic 5'-3' exonucleolytic activities. Accordingly, if a probe is 

30 hybridized to a sequence that can serve as a template for polymerization (for instance, if a 
probe is hybridized to a region of DNA located between two amplification primers, during 
the course of an amplification reaction), a polymerizing enzyme that has initiated 
polymerization at an upstream amplification primer is capable of exonucleolytically digesting 
the probe. Any label attached to such a probe will be released, if the probe is hybridized to 
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its target and if amplification is occurring across the region to which the probe is hybridized. 
Released label is separated from labeled probe and detected by methods well-known to those 
of skill in the art, depending on the nature of the label. For example, radioactively labeled 
fragments can be separated by thin-layer chromatography and detected by autoradiography; 
5 while fluorescently-labeled fragments can be detected by irradiation at the appropriate 

excitation wavelengths with observation at the appropriate emission wavelengths. See, e.g., 
U.S. Patent No. 5,210,015. 

In a variation of this technique, a probe contains both a fluorescent label and a 
quenching agent, which quenches the fluorescence emission of the fluorescent label. In this 

10 case, the fluorescent label is not detectable until its spatial relationship to the quenching agent 
has been altered, for example by exonucleolytic release of the fluorescent label from the 
probe. Thus, prior to hybridization to its target sequence, the dual fluorophore/quencher 
labeled probe does not emit fluorescence. Subsequent to hybridization of the 
fluorophore/quencher-labeled probe to its target, it becomes a substrate for the exonucleolytic 

1 5 activity of a polymerizing enzyme which has initiated polymerization at an upstream primer. 
Exonucleolytic degradation of the probe releases the fluorescent label from the probe, and 
hence from the vicinity of the quenching agent, allowing detection of a fluorescent signal 
upon irradiation at the appropriate excitation wavelengths. This method has the advantage 
that released label does not have to be separated from intact probe. Multiplex approaches 

20 utilize multiple probes, each of which is complementary to a different target sequence and 
carries a distinguishable label, allowing the assay of several target sequences simultaneously. 

The use of MGB-modified oligonucleotide conjugates in this and related 
methods allows greater speed, sensitivity and discriminatory power to be applied to these 
assays. In particular, the enhanced ability of MGB-modified oligonucleotide conjugates to 

25 allow discrimination between a perfect hybrid and a hybrid containing a single-base 

mismatch will facilitate the use of hydrolyzable probe assays in the identification of single- 
nucleotide polymorphisms and the like. One of skill in the art will appreciate that 
compositions and methods, such as those of the invention, that are capable of discriminating 
single-nucleotide mismatches will also be capable of discriminating between sequences that 

30 have 2, 3, 4, 5, or even 6 or more mismatches with respect to one another. 

In yet another related aspect, the present invention provides a method for 
primer extension, the method comprising incubating a polynucleotide containing a target 
sequence with one or more oligonucleotide primers complementary to the target sequence, in 
the presence of a polymerizing enzyme and nucleotide substrates under conditions favorable 
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for polymerization; wherein at least one of the oligonucleotide primers contains a 3- 
substituted pyrazolo[3,4-^pyrimidine or a 5-substituted pyrimidine base in place of a purine 
orpyrimidinebase. 

Amphfication procedures are those in which many copies of a target nucleic 

5 acid sequence are generated, usually in an exponential fashion, by sequential polymerization 
and/or ligation reactions. In addition to the more traditional amplification reactions discussed 
below, the present invention is useful in amplifications involving three-way junctures (see, 
WO 99/37085), signal amphfication (see Capaldi, et al., Nuc, Acids Res,, 28:E21 (2000)), T7 
polymerases, reverse transcriptase, RNase H, RT-PCR, Rolling Circles, cleavase and the like. 

10 Many amphfication reactions, such as PCR, utilize reiterative primer- 

dependent polymerization reactions. A primer is a nucleic acid that is capable of hybridizing 
to a second, template nucleic acid and that, once hybridized, is capable of being extended by 
a polymerizing enzyme (in the presence of nucleotide substrates), using the second nucleic 
acid as a template. Polymerizing enzymes include, but are not limited to, DNA and RNA 

15 polymerases and reverse transcriptases, etc. Conditions favorable for polymerization by 

different polymerizing enzymes are well-known to those of skill in the art. See, for example, 
Sambrook et al, supra; Ausubel, et al, supra; Innis et al, supra. Generally, in order to be 
extendible by a polymerizing enzyme, a primer must have an unblocked 3'-end, preferably a 
free 3* hydroxyl group. The product of an amplification reaction is an extended primer, 

20 wherein the primer has been extended by a polymerizing enzyme. 

Thus, in one embodiment of the invention, the methods and compositions 
disclosed and claimed herein are useful in improved amplification reactions such as PCR. 
See, e.g., U.S. Patents 4,683,202; 4,683,195 and 4,800,159; Mullis and Faloona, supra; and 
Saiki et al, supra. The polymerization step of PCR is most often catalyzed by a thermostable 

25 polymerizing enzyme, such as a DNA polymerase isolated from a thermophilic bacterium, 
because of the elevated temperatures required for the denaturation step of PCR. As discussed 
supra, one of the problems heretofore associated with the practice of PCR is the requirement 
for relatively long oligonucleotide primers, having sufficient hybrid stability to serve as 
primers at the elevated temperatures under which PCR is conducted. Modified 

30 oligonucleotides and especially MGB-modified oligonucleotide conjugates are useful as 
primers in amplification reactions such as PCR, as the modified bases and MGBs increase 
hybrid stability, thereby significantly extending the lower limit of useful primer length. In 
addition, MGB-modified oligonucleotide conjugates are useful in specialized PCR protocols 
wherein reduced primer length is desirable. These include, but are not limited to, differential 
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display, in which optimal primer length is below 10 nucleotides, random amplification of 
polymorphism in DNA (RAPD) techniques, and amplification length polymorphism 
analyses. Liang et al, supra; Williams et aL, supra. 

The modified oligonucleotides of the present invention are applicable to any 
5 type of assay or procedure in which PCR or a related amplification technique is used, 
including, but not limited to, hydrolyzable probe assays, priming with allele-specific 
oligonucleotides (ASOs), fragment length polymorphism analysis, single nucleotide 
polymorphism (SNP) analysis and microsatellite analysis, for example. These and other 
techniques are useful in gene mapping, in the identification and screening of disease-related 
1 0 genes, and in pharmacogenetics, to name just a few applications. 

In still another related aspect, the present invention provides a method for 
determining the nucleotide sequence of a polynucleotide, the method comprising: 

(a) incubating the polynucleotide with a modified oligonucleotide array under 
hybridization conditions; and 
1 5 (b) determining to which of the modified oligonucleotides in the array the 

polynucleotide hybridizes; 

wherein the modified oligonucleotides comprise at least one 3-substituted 
pyrazolo[3,4-d]pyrimidine in place of a purine base. 

In these procedures, an ordered array comprising a plurality of modified 
20 oligonucleotides of different known sequences is used as a platform for hybridization to one 
or more test polynucleotides, nucleic acids or nucleic acid populations. Determination of the 
oligonucleotides which are hybridized and alignment of their known sequences allows 
reconstruction of the sequence of the test polynucleotide. See, for example, U.S. Patent Nos. 
5,492,806; 5,525,464; 5,556,752; and PCT Publications WO 92/10588 and WO 96/17957. 
25 Materials for construction of arrays include, but are not limited to, nitrocellulose, glass, 
silicon wafers, optical fibers and other materials suitable for construction of arrays such as 
are known to those of skill in the art. 

A major problem with current array-based sequencing and analysis methods is 
that the different oligonucleotides in an array will each have a different T m . Hence, it is 
30 difficult to determine the stringency conditions that will provide maximum sensitivity, while 
retaining the ability to distinguish single-base mismatches. This is a particularly important 
consideration for most, if not all, applications of array technology. Use of modified 
oligonucleotides and/or MGB-modified oligonucleotide conjugates in array-based sequencing 
and analysis techniques provides a solution to this problem. Surprisingly, conjugation of a 
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MGB to a modified oligonucleotide makes its T m relatively independent of base composition. 
Thus, for a population of modified oligonucleotides and MGB-modified oligonucleotide 
conjugates of a given length, the T m for a perfect hybrid falls within a relatively narrow 
temperature range regardless of sequence. At the same time, the T m for a single nucleotide 
mismatch is well below the T m of the perfect match. Thus, arrays designed such that all 
modified oligonucleotides are the same length and are optionally present as their MGB 
conjugates exhibit minimal variation in T m among the different oligonucleotides in the array, 
enabling more uniform hybridization conditions for the entire array. A further advantage to 
the use of modified oligonucleotides and MGB-modified oligonucleotide conjugates in these 
techniques is that it provides greater sensitivity, by allowing the use of shorter 
oligonucleotides, at higher temperatures (and hence higher stringency), while retaining 
single-nucleotide resolution. 

An additional application of the present invention to array technology is in the 
examination of patterns of gene expression in a particular cell or tissue (see, generally, Eisen, 
et al, Methods in Enzymology, 303:179-205 (1999)). In this case, modified 
oligonucleotides or polynucleotides corresponding to different genes are arrayed on a surface, 
and a nucleic acid sample from a particular cell or tissue type, for example, is incubated with 
the array under hybridization conditions. Detection of the sites on the array at which 
hybridization occurs allows one to determine which modified oligonucleotides have 
hybridized, and hence which genes are active in the particular cell or tissue from which the 
sample was derived. 

Array methods can also be used for identification of mutations or 
polymorphisms, where wild-type and mutant sequences are placed in an ordered array on a 
surface (see, Hacia, et al., J. Mol Genet, 36:730-736 (1999)). Hybridization of a 
polynucleotide sample to the array under stringent conditions, and determination of which 
oligonucleotides in the array hybridize to the polynucleotide, allows determination of whether 
the polynucleotide possesses the wild-type or the mutant sequence. The increased 
discriminatory abilities of MGB-oligonucleotide conjugates are especially useful in this 
application of array technology. 

Accordingly, the present invention provides a method for identifying a 
mutation in a target sequence of a gene of interest, the method comprising: 

(a) incubating a polynucleotide comprising the target sequence with an array 
of oligonucleotides of different sequences, wherein the different sequences include the wild- 
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type target sequence and different mutant target sequences, under hybridization conditions; 
and 

(b) determining which of the oligonucleotides in the array become hybridized 
to the polynucleotide; 

5 wherein one or more purine residues in a plurality of the oligonucleotides are 

replaced with a 3-substituted pyrazolo[3,4-<f]pyrimidine. 

In yet another related aspect, the present invention provides a method 
determining the nucleotide sequence of a target sequence in a polynucleotide, the method 
comprising: 

10 (a) contacting a polynucleotide comprising the target sequence with at least 

two oligonucleotides of known sequence wherein one or more purine residues of the 
oligonucleotides are replaced by a 3-substituted pyrazolo[3,4-^pyrimidine, and wherein one 
of the at least two oligonucleotides has a sequence that is perfectly complementary to the 
target sequence and at least one other of the oligonucleotides has a related target sequence 

1 5 and incubating each of the oligonucleotides with the polynucleotide under hybridization 
conditions; and 

(b) determining the degree of hybridization between each of the 
oligonucleotides and the polynucleotide. 

In one embodiment, a collection of all possible n-mer oligonucleotides (where 

20 n is an integer less than about 10) are used in a hydrolyzable probe assay to determine a 

nucleotide sequence. Each oligonucleotide is uniquely labeled (and preferably modified) and 
analysis of released label indicates which of the oligonucleotides has hybridized to the target 
sequence. Alignment of the sequences of the oligonucleotides which have hybridized 
provides the nucleotide sequence. 

25 Modified oligonucleotides, and more preferably modified oligonucleotide- 

MGB conjugates are also useful in primer-dependent methods of DNA sequencing, such as 
the chain-termination method and its derivatives, originally described by Sanger et al, supra. 
Use of MGB-modified oligonucleotide conjugates in chain-termination sequencing allows the 
use of shorter primers at higher stringency, and enables a greater degree of mismatch 

30 discrimination during sequencing. Examples include, but are not limited to, a search for 

genes sharing a short region of homology (on the order of a few amino acids) and sequencing 
in a region in which very little existing sequence information is available. MGB- 
oligonucleotide conjugates are useful in such short primer sequencing techniques. 
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In still another embodiment, the present invention provides a method for 
examining gene expression in a cell, the method comprising: 

(a) incubating a population of polynucleotides representative of the genes 
expressed in the cell with an oligonucleotide array comprising a plurality of modified 

5 oligonucleotides of different sequences under hybridization conditions, and 

(b) determining which of the modified oligonucleotides in the array become 
hybridized to polynucleotides; 

wherein said modified oligonucleotides comprise at least one 3 -substituted 
pyrazolo[3,4-<flpyrimidine in place of a purine. 

1 0 In one group of embodiments, the method is carried out with a plurality of 

expressed genes from multiple cell-types or tissues. The genes are preferably labeled with 
different detection labels, then hybridized to an array at the same time and monitored via the 
label to determine each gene's expression pattern. 

In addition to the assays and diagnostic methods described above, the 

1 5 modified oligonucleotides and modified bases described herein will have utility in antisense 
technologies. Antisense oligonucleotides are known to selectively inhibit gene expression 
and provide a genetic approach for disease treatment and prevention (Smith et al, Int. J. 
Oncol, 17:841-850 (2000)). Other modified oligonucleotides have been used to improve the 
performance of antisense oligonucleotides ( Zhang et al, Nat BiotechnoL, 18:862-867 

20 (2000); Flanagan et al, Nat BiotechnoL, 14:1 139-1 145 (1996)). Accordingly, another 

embodiment of the present invention is the use of the 5-substituted pyrimidines, unsubstituted 
pyrazolo[3,4-d]pyrimidines and 3-substituted pyrazolo[3,4-d]pyrimidine as monomers, either 
alone or in any combination, in the synthesis of antisense oligomers. In another embodiment 
the modified bases described herein can be used as monomers in an oligonucleotide to 

25 decrease enzymatic degradation of the antisense oligomers. 

New Modified Bases 

In another aspect, the present invention provides a number of new modified 
30 bases. These bases have the general formula: 

Y 2 — I B 

IF 1 ? 

Y 1 Z 1 
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wherein Z 1 is a member selected from the group consisting of H, F and OR a wherein R a is a 
member selected from the group consisting of H, (Ci-C 8 )alkyl and a hydroxy protecting 
group; Z 2 is a member selected from the group of H, (Ci-C 8 )alkyl, or is optionally combined 
with Z l to form a five- to seven-membered ring, having from one to three heteroatoms 

5 selected from the group consisting of O, S and N; Y 1 is a member selected from the group 
consisting of OH, a protected hydroxy group and O-P 1 , wherein P l is aphosphoramidite or 
H-phosphonate group; Y 2 is a member selected from the group consisting of OH, a protected 
hydroxy group and O-P 2 , wherein P 2 is a phosphoramidite, H-phosphonate, monophosphate, 
diphosphate or triphosphate; and B is a modified nucleotide selected from the group 

10 consisting of: 



wherein X 1 1 and X 12 are each independently selected from the group consiting of H, NH 2 and 
a protected amino group; each R 11 is independently selected from the group consisting of 
protected or unprotected forms of 3-hydroxyprop-l-ynyl, 3-aminoprop-l-ynyl, 3- 

1 5 methoxyprop-l-ynyl, 4-hydroxy-l-butynyl, 4-amino-l-butynyl and 3-(hydroxymethyl)-4- 
hydroxy-l-butynyl; and each R 12 is independently selected from the group consisting of 
heterocyclyl(C|-Ct 2 )alkyl, heterocyclyl(C 2 -Ci 2 )alkenyl, heterocyclyl(C 2 -Ci 2 )alkynyl, 
heterocyclyl, 3-hydroxyprop-l-ynyl, 3-aminoprop-l-ynyl, 3-methoxyprop-l-ynyl, 4-hydroxy- 
l-butynyl, 4-amino-l-butynyl and 3-(hydroxymethyl)-4-hydroxy-l-butynyl, with the proviso 

20 that R 12 is other than 2-pyridylethynyl. 



Preferably, X 11 is NH 2 and X 12 is H. More preferably, X 11 is NH 2 , X 12 is H, Y 1 is O-P 1 , Y 2 is 
a protected hydroxy, Z 1 is H, and R 12 is selected from 3-hydroxyprop-l-ynyl, 3-aminoprop-l- 
25 ynyl, 4-hydroxy-l-butynyl, 4-amino-l-butynyl, 3-(hydroxymethyl)-4-hydroxy-l-butynyl, 3- 
methoxyprop-l-ynyl, 2-furanyl, 3-furanyl, 2-thienyl, 3-thienyl, 2-pyrrolyl, 3-pyrrolyl, 1,3- 
isoxazol-4-yl, l-,3-isoxazol-5-yl, l,3-isoxazol-2-yl, 2-imidazolyl, 4-imidazolyl, 2-pyridyl, 3- 




In one group of embodiments, B is 
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pyridyl and 4-pyridyl. In particularly preferred embodiments, Y 1 is -0-[(2-cyanoethyl) N,N- 
diisopropylphosphoramidite] and Y 2 is -0-(4,4'-dimethoxytrityl). 
In another group of embodiments, B is 




WW 



Preferably, X 12 is NH 2 or H. More preferably, X 12 is NH 2 or H, Y 1 is O-P 1 , Y 2 is a protected 
hydroxy, Z 1 is H, and R 12 is selected from 3-hydroxyprop-l-ynyl, 3-aminoprop-l-ynyl, 3- 
(hydroxymethyl)-4-hydroxy-l-butynyl, 3-methoxyprop-l-ynyl, 2-furanyl, 3-furanyl, 2- 
thienyl, 3-thienyl, 2-pyrrolyl, 3-pyrrolyl, l,3-isoxazol-4-yl, l,3-isoxazol-5-yl, l,3-isoxazol-2- 
yl, 2-imidazolyl, 4-imidazolyl, 2-pyridyl, 3-pyridyl and 4-pyridyL In particularly preferred 
embodiments, Y 1 is -0-[(2-cyanoethyl) N,N-diisopropylphosphoramidite] and Y 2 is - 
0-(4,4'-dimethoxytrityl). 

In still other preferred embodiments, the modified oligonucleotides of the 
invention have at least one base of formula HI (including those in which the acetyl protecting 
groups are removed, and including other protected forms and activated forms thereof). 




OH 

m 

Preparation of compounds of formula III (including compounds wherein the 6- 
amino group is protected and the hydroxy groups of the sugar are either protected or activated 
as a phosphoramidite) are provided in Reaction Scheme 6, above. 

In the most preferred embodiments of the invention, the modified bases are 
selected from formulae IVa, IVb or IVc. 
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Me 2 N 



^ M x 



<r N < 

IVa 




DMTr0^y°J 

O f y 



IVb 




IVc 

These compounds are particularly well-suited for use in automatic 
oligonucleotide synthesizers, and in preparing certain modified oligonucleotides described 
herein. 

Other modified bases that are useful in the present invention include those 
represented by formulae Va and Vb: 




f\-K X X < N 



DMTrO' 



Va 



\ ' < 
Vb 
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wherein R 7 is selected from OH, SH or NH 2 . These compounds can be synthesized by 
methods described herein. 

In another group of embodiments, B is 




5 Preferably, X 1 1 is NH 2 . More preferably, X n is NH 2 , Y 1 is O-P 1 , Y 2 is a protected hydroxy, 
Z 1 is H, and R n is selected from 3-hydroxyprop-l-ynyl, 3-aminoprop-l-yayl, 4-hydroxy-l- 
butynyl, 4-amino-l-butynyl, 3-(hydroxymethyl)-4-hydroxy-l-butynyl, and 3-methoxyprop-l- 
ynyl. In particularly preferred embodiments, Y 1 is -0-[(2-cyanoethyl) N,N- 
diisopropylphosphoramidite] and Y 2 is -0-(4,4'-dimethoxytrityl). 

10 In another group of embodiments, B is 



O 




Preferably, Y 1 is O-P 1 , Y 2 is a protected hydroxy, Z 1 is H, and R 11 is selected from 3- 
hydroxyprop-l-ynyl, 3-aminoprop-l-ynyl, 4-hydroxy- 1 -butynyl, 4-amino-l-butynyl, 3- 
(hydroxymethyl)-4-hydroxy-l-butynyl, and 3-methoxyprop-l-ynyl. In particularly preferred 
15 embodiments, Y 1 is -0-[(2-cyanoethyl) N,N-diisopropylphosphoramidite] and Y 2 is - 
0-(4,4'-dimethoxytrityl). 

In a related aspect, the present invention provides modified oligonucleotides 

having the formula: 

Z 3 Z 1 
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wherein each Z 1 is independently selected from H, F and 0R a wherein R a is a member 
selected from H, (Ci-C 8 )alkyl and a hydroxy protecting group; each Z 2 is H or (Ci-C 8 )alkyl, 
or is optionally combined with Z l to form a five- to seven-membered ring; each Z 3 is selected 
from O, S or NH; each Y is independently selected from P(0)OH, P(S)OH and P(0)CH 3 ; the 
subscript n is an integer of from 1 to 98; W l and W 2 are each independently selected from H, 
a monophosphate, a diphosphate, a triphosphate and a minor groove binder-linking group 
moiety having an optionally attached reporter group or quencher; and each B is 
independently selected from the group consisting of adenine, guanine, cytosine, uridine and 
modified bases of the formula: 




wherein X 1 1 and X 12 are each independently selected from H, NH2 and a protected amino 
group; each R 11 is independently selected from protected or unprotected forms of 3- 
hydroxyprop-l-ynyl, 3-aminoprop-l-ynyl, 3-methoxyprop-l-ynyl, 4-hydroxy-l-butynyl, 4- 
amino-l-butynyl, and 3-(hydroxymethyl)-4-hydroxy-l-butynyl; and eachR 12 is 
independently selected from the group consisting of protected or unprotected forms of 3- 
hydroxyprop-l-ynyl, 3-aminoprop-l-ynyl, 3-methoxyprop-l-ynyl, 4-hydroxy-l-butynyl, 4- 
amino-l-butynyl, 3-(hydroxymethyl)-4-hydroxy-l-butynyl, heterocyclyl(Ci-Ci 2 )alkyl, 
heterocyclyl(C 2 -Ci2)alkenyl, heterocyclyl(C 2 -Ci 2 )alkynyl and heterocyclyl, with the provsio 
that R 12 is other than 2-pyridylethynyl; and 

with the further proviso that at least one of the Bs is selected from the 
modified bases, and optionally, one or more of the Bs has an attached minor groove binder- 
linking group moiety, reporter group or a combination thereof. 

Particularly preferred modified bases are those that have been described 

above. 

EXAMPLES 

In the hydrolyzable probe assay, a labeled probe is added to a PCR reaction. 
The probe is complementary to a region between the two PCR primers and is labeled with 
two fluorophores, one of which quenches the fluorescence of the other. The probe is 
designed to hybridize to its complementary target sequence on one of the PCR product 
strands at or above the strand extension temperature typically used in PCR (55-75°C). The 
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polymerizing enzymes normally used in PCR (Taq polymerase in particular) possess an 
intrinsic S'-exonuclease activity. During synthesis of new strands in the extension stage of 
the PCR reaction, this 5'-exonuclease activity will act on complementary strands bound to the 
template. If a probe, labeled as described above, is bound to the template, the 5'-exonuclease 
activity associated with the polymerizing enzyme will liberate the bound fluorophore. Once 
liberated, its fluorescence will no longer be quenched, and a fluorescent signal will be 
obtained. See, for example, U.S. Patent No. 5,210,015; Livak et al (1995) PCRMeth. App. 
4:357-362; and Heid et al. (1996) Genome Res. 6:986-994. 

Thin-layer chromatography was run on silica gel 60 F-254 (EM Reagents) 
aluminum-backed plates. l H NMR were obtained at 300 MHz on a Varian VXR-300 
spectrometer. Two-dimensional (Cosy) and NOE experiments assisted in the assignment of 
proton resonances. Elemental analyses were performed by Quantitative Technologies Inc. 
(Boundbrook, NJ). 

Example 1 

This example illustrates the synthesis of 5-(Prop-2-ynyl-4-methylbenzoate)-5'- 
0-(4,4'-dimethoxytriphenylmethyl)-2'-deoxyuridine 3 '-[(2-cyanoethyl) N,N- 
diisopropylphosphoramidite] ( 3). 
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5-(Prop-2-ynyl-4-methylbenzoate)-2 '-deoxyuridine (1). 

To a mixture of 5-iodo-2'-deoxyuridine (5.0 g, 14.12 mmol), Cul (270 mg, 
1 .42 mmol), Pd(PPh 3 ) 4 (0.82 g, 0.714 mmol), and triethylamine (2.4 mL) in 30 mL of 

5 anhydrous DMF was added prop-2-ynyl 4-methylbenzoate (6.10 g, 35.06 mmol). The 
mixture was stirred under argon for 4 h. and then evaporated to dryness. The residue was 
triturated in methanol and the excess prop-2-ynyl 4-methylbenzoate that precipitated was 
filtered away. The filtrate was evaporated and the residue was purified by silica gel 
chromatography eluting with 10% methanol in ethyl acetate. The pure product fractions were 
10 evaporated to dryness and the residue was precipitated from ethyl aceate- ether: 3.14 g (56%) 
yield; TLC (10% methanol in ethyl acetate), R f = 0.50; ! H NMR (DMSO-d 6 ) 8 1 1 .68 (1H, s, 
uracil N-H), 8.29 (1H, s, 6-H), 7.89 and 7.35 (4H, 2 x d, J = 8.2 Hz, toluoyl Hs), 6.10 (1H, t, J 
= 6.4 Hz, l'-H), 5.24 (1H, d, J - 4.4 Hz, 3'-OH), 5.15 (2H, s, alkynyl-CH 2 ), 5.12 (1H, t, J = 
5.2 Hz, 5'-OH), 4.24 (1H, m, 3'-H), 3.79 (1H, q, 4'-H), 3.59 (2H, m, 5'-Hs), 2.39 (3H, s, 

15 toluoyl-CH 3 ),2.13(2H,m,2'-H). 

5-(Prop-2-ynyl-4-methylbenzoate)-5 '-0-(4,4 '-dimethoxytriphenylmethyl)-2 '-deoxyuridine 
(2). 

To a solution of 1 (3.0 g, 7.50 mmol) in 45 mL of anhydrous pyridine was 
20 added dimethoxytrityl chloride (3 .0 g). The resulting solution was stirred for 4 h. at room 
temperature and then poured into 400 mL of 5% sodium bicarbonate solution. The mixture 
was extracted with ethyl acetate (2 x 300 mL) and the extract was dried over sodium sulfate, 
filtered and evaporated. The residue was purified by silica gel chromatography eluting with 
5% methanol in the ethyl acetate. The pure product fractions were pooled and evaporated 
25 affording 2 as a foam: 4.16 g (79%) yield 

5-(Prop-2-ynyl-4-methylbenzoate)-5 '-0-(4,4 '-dimethoxytriphenylmethyl)-2 '-deoxyuridine 3 - 
[(2-cyanoethyl) N,N-diisopropylphosphoramidite] (3). 

To a solution of 2 (4.0 g, 5.70 mmol) in 130 mL of anhydrous methylene 
30 chloride, containing 3.0 mL of M//-diisopropylethylamine was added chloro(2- 

cyanoethoxy)(iV,iV-diisopropylammo)phosphine (2.22 mL) under argon. The solution was 
stirred for 30 min at room temperature and then treated with 3.0 mL of methanol. The 
solution was diluted with 300 mL of ethyl acetate and washed with 300 mL of 5% sodium 
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bicarbonate solution. The aqueous phase was extracted with 300 mL of ethyl acetate and the 
combined organic phases were dried over sodium sulfate, filtered and evaporated. The 
residue was purified by silica gel chromatography eluting with a gradient of 30 to 0% hexane 
in ethyl acetate (2% triethylamine). The pure fractions were pooled and evaporated and the 
residue was precipitated from ethyl acetate - hexanes: 3.35 g (65%) yield; TLC (20% hexane 
in ethyl acetate), R f = 0.82 and 0.71 (diasteromers); 3, P NMR (DMSO-d*) 5 147.82 and 
147.45. 

Example 2 

Phase 1 Preparation of (2RMh5-(4-amino-34odopyrazolo[3A-d]pynmi^^^ 
(hydroxymethyl)oxolan-3-ol (9). 

3-Iodo~l,5-dihydropyrazolo[3, 4-d]pyrimidin-4-one (4). 

The synthesis of this compound was previously reported by Taylor et. al 
{Tetrahedron, 48(37):8089-8100 (1992)) using a N-iodosuccinimide. Our synthesis, 
employing iodine momochloride as the iodinating agent is described below. 

To a 1.0 L solution of 1.2 M sodium acetate was added 4- 
hydroxypyrazolo[3,4-d]pyrimidine (25 g, 184 mmol) followed by iodine monochloride (46 g, 
284 mmol). The mixture was stirred in a sealed container for 4 h. at 1 10 °C. The completed 
reaction was cooled to room temperature and treated with a solution of 30 g of sodium 
metabisulfite in 200 mL of water. The white precipitate that formed was filtered and rinsed 
with cold water. The solid was then dissolved in 200 mL of 3.2 M potassium hydroxide 
solution. Hydrazine monohydrate (5 mL) was added and the solution was stirred for 15 min. 
The solution was then acidified to ca. pH 7 by addition of 50 mL of concentrated HC1 
followed by a fine pH adjustment with acetic acid. The solid that formed was filtered, rinsed 
with cold water and dried: 55.1 g (1 14% yield). Note- it is difficult to remove residual water 
by drying the solid under vacuum. The residual water will be removed by a pyridine 
evaporation process in the next step. 

3-Iodo-l, 5-dihydropyrazolo[3, 4-dJpyrimidin-4-thione (5). 

Compound 4 (21.5 g, 82.1 mmol) was suspended in 150 mL of anhydrous 
pyridine and the mixture was evaporated to dryness. The solid was re-suspended in 170 mL 
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of dry pyridine (under argon) and P 2 S 5 (26.8 g) was added. The mixture was stirred for 10 
min at 65 °C and then for 2-6 hours at 95 °C. The resulting solution was cooled to room 
temperature and degassed by bubbling a stream of argon into the solution (sulfide gas is 
passed into a trap containing sodium hydroxide solution). The reaction solution was then 
5 reduced in volume until a thick syrup formed. Excess P 2 S 5 was decomposed by addition of 1 
mL aliquots of water, until the vigorous reaction ceased, followed by addition of 500 mL of 
water and 10 mL of acetic acid. The mixture was warmed to 70 °C for 1 h to expedite 
expulsion of hydrogen sulfide gas and then diluted with 500 mL of water and cooled in an 
ice-bath. The solid was filtered, washed with water and dryed: 19.8 g (87%) yield. 

10 

4-Ethylthio-3-iodopyrazolo[3, 4-dJpyrimidine (6). 

Compound 5 (43.5 g, 157 mmol) was stirred in potassium hydroxide solution 
(38.6 g KOH in 350 mL of water) for 30 min. The mixture was filtered and the filtrate was 
acidified to pH 10 by addition of acetic acid and then diluted with 350 mL of absolute 

15 ethanol. Iodoethane (10 mL) was added and the solution was stirred at room temperature. 
Additional 10 mL aliquots of iodoethane were added after 30 min. and 1.0 h. The reaction 
was complete after a total of 90 min. Other runs required further addition of iodoethane to 
complete the reaction. The reaction solution was diluted with 700 mL of water and 20 mL of 
acetic acid. The mixture was cooled in an ice-bath and the solid was filtered, rinsed with 

20 water and dried. This crude product was dissolved in 300 of DMF and stirred at 90 °C for 1 5 
min. Insoluble material was filtered away and the filtrate was diluted with 1 L of water and 
cooled in an ice-water bath. The solid was filtered, rinsed with water and dried: 1 9 g (40%) 
yield. 

25 [(2R t 5-R)-5-(4-Ethylthio-3-iodopyrazolo[3J-d]p 

methylphenylcarbonyloxy)oxolan-2-yl]methyl 4-methylbenzonate (7). 

Compound 6 was converted to the corresponding potassium salt by reaction 
with 1 molar equivalent of potassium hydroxide in water. The resulting solution was 
evaporated to dryness and the residue was evaporated from dry acetonitrile. The potassium 

30 salt of 6 (29.0 g, 94.73 mmol) was dissolved in 80 mL of anhydrous DMF and then diluted 
with 830 mL of anhydrous acetonitrile. The chlorosugar derivative (48 g, 123 mmol) was 
added in one portion and the mixture was stirred for 3 h and then diluted with 1.5 L of 25% 
aqueous methanol. The mixture was allow to sit at 5 °C overnight. The crystals were 
filtered, rinsed with 25% aqueous methanol and dried: 32.2 g (52%) yield. 
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(2RM)-2-(hydroxymethyl)-5-(3-iodo^-methylpyrazolo[3,4-d]pyri 

To a suspension of 7 (10.97 g, 16.66 ramol) in 250 mL of methanol was added 
22 mL of IN sodium methoxide in methanol. The mixture was stirred at reflux and the 
reaction progress was monitored as soon as the mixture turned to a clear solution Note- 
formation of side products are observed if refluxing is continued after complete conversion to 
5. The reaction was quenched by addition of 1 .34 mL of acetic acid as soon as one 
component, corresponding to the desired product, is observed by TLC. The solution was 
evaporated and the residue was precipitated from methanol-ether-hexane. The solid was 
filtered and dried: 6.8 g (104%) yield crude product (contains sodium acetate). 

(2R, 5R)-5-(4-amino-3-iodopyrazolo[3, 4-d]pyrimidinyl)-2-(hydroxymethyl)oxolan-3-ol (9). 

Compound 8 (6.8 g, 17.35 mmol) was stirred in 200 mL of concentrated 
ammonium hydroxide (sealed container) at room temperature for 36 h. The mixture was 
evaporated and the residue was precipitated from acetonitrile-ether. The solid was filtered 
and dried: 5.36 g (82%) yield. NMR and spectral data on this compound is identical to 

Phase 2 Preparation of 3-{4-((lE)-l-aza-2-methylprop-l-enyl)-l-((2R,5R)-5-{[Bis(4- 

methoxyphenyl)phenylmethoxy]rwthyl}-4-{lbis(methy^ 

cyanoethoxy)phosphinooxy}oxolan-2-yl)pyrazolo[3,4-dJpyrimidin-3-y^ 

methoxybenzoate (13; Rl=-OCOPhCH 3 ) 

3-{l-{(2RM)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-4-amm^^ 
3-yl}prop-2-ynyl} 4-methylbenzoate (10; R 1 = -OCOPhCH 3 ) 

To a mixture of 9 (2.40 g, 6.37 mmol), Cul (124 mg, 0.648 mmol), Pd(PPh 3 ) 4 
(380 mg, 0.331 mmol), and triethylamine (1.32 mL) in 12 mL of anhydrous DMF was added 
prop-2-ynyl 4-methylbenzoate (1 .87 g, 1 1 .85 mmol). The mixture was stirred under argon 
for 12 h. and then evaporated to dryness. The residue was purified by silica gel 
chromatography eluting with 5% methanol in ethyl acetate. The pure product fractions were 
evaporated affording a foam: 2.29 g (85%) yield; TLC (10% methanol in ethyl acetate), R f = 
0.43; l H NMR (DMSO-de) § 8.26 (1H, s, 6-H), 7.92 and 7.37 (4H, 2 x d, J = 8.5 Hz, 
toluoyl-Hs), 6.55 (1H, t, J = 6.3 Hz, 1 '-H), 5.29 (2H, s, alkynl-CH 2 ), 5.28 (1H, d, J = 4.7 Hz, 
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3'-0H), 4.76 (1H, t, J = 5.7 Hz, 5'-OH), 4.41 (1H, m, 3'-H), 3.81 (1H, m, 4'-H), 3.49 and 
3.56 (2H, 2 x m, 5'-Hs), 2.76 and 2.24 (2H, 2 x m, 2'-Hs), 2.39 (3H, s, toluoyl- CH 3 ). 

HHOE)-l-aza-2-methylprop-l-enyl)-l-((2RM)-4-hydroxy-5-(hydroxynte 
yl]pwazolo[3,4-d]pyrimidin-3-yl}prop-2-ynyl4-methylbenzoate (11; R 1 = -OCOPhCH 3 ). 

Compound 10 (1.76 g, 4.16 mmol) was stirred in a solution of 5.0 mL of N,N- 
dimethylacetamide, 1.9 mL of ^Af-dimethylacetamide dimethyl acetal and 2.0 mL of 
anhydrous triethylamine for 15 h. at room temperature. The solvents were evaporated and the 
residue was evaporated twice from xylenes affording 11 as a foam. TLC (10% methanol in 
ethyl acetate), R f = 0.29; 'H NMR (DMSO-dfi) 5 8.5 1 (1H, s, 6-H), 7.90 and 7.37 (4H, 2 x 
d, J = 8.3 Hz, toluoyl protons), 7.28, 7.14 and 6.74 (13H, d and 2 m, J = 7.1 Hz for the 
doublet), trityl protons), 6.63 (1H, m, l'-H), 5.32 (1H, d, J = 5.0 Hz, 3'-OH), 5.24 (2H, s, 
alkynl-CH 2 ), 4.56 (1H, m, 3'-H), 3.93 (1H, m, 4'-H), 3.68 (6H, s, methoxy Hs), 3.32 and 3.11 
(6H, 2 x s, N-methyls), 3.09-2.95 (2H, m, 5'-Hs), 2.81 and 2.32 (2H, 2 x m, 2'-Hs), 2.40 (3H, 
s, = - Me), 2.19 (3H, s, toluoyl-Me). 

3-{4-((lE)-l-aza-2-methy!prop-l-enyl)-l-((2R,5R)-5-{[Bis(4- 
methoxyphenyl)phenylmethoxy]methyl}-4-hydroxyoxolan-2-yl)w^ 
ylJprop-2-ynyl 4-methylbenzoate (12; R 1 =-OCOPhCHj) 

To a solution of 11 in 25 mL of anhydrous pyridine was added dimethoxytrityl 
chloride (1.67 g). The resulting solution was stirred for 4 h. at room temperature and then 
poured into 250 mL of 5% sodium bicarbonate solution. The mixture was extracted with 
ethyl acetate and the extract was dried over sodium sulfate, filtered and evaporated. The 
residue was purified by silica gel chromatography eluting with 5% methanol in the ethyl 
acetate. The pure product fractions were pooled and evaporated affording 12 as a foam: 2.06 
g (62%) yield for the two-step process. 

3-{4-((lE)-l-aza-2-methylprop-l-enyl)-l-((2R,5R)-5-{[Bis(4- 

methoxyphenyl)phmylmethoxy]methyl}-4-{[bis(methylethyl)amino](2- 

cyanoethoxy)phosphinooxy}oxolan-2-yl)pyrazolo[3,4-d]pyrimidin^ 

methoxybenzoate (13; R 1 = -OCOPhCH 3 ) 

or, 
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l-[2-Deoxy-5-0-(4J'-dimethoxytriphenylme^ 
[(dimethylamino)prop^-enyl]-3-(prop-2^ 

d]pynmidine3 > -[(2-cyanoeihyl) N,N-diisopropylphosphoramidite] (13). 

To a solution of 12(2.03 g, 2.56 mmol) in 60 mL of anhydrous methylene 
5 chloride, containing 1.33 mL of iV;#-diisopropylethylamine was added chloro(2- 

cyanoethoxy)(AT,iV-diisopropylamino)phosphine (1 .0 mL) under argon. The solution was 
stirred for 1.0 h at room temperature and then treated with 2.0 mL of methanol. The solution 
was diluted with 250 mL of ethyl acetate and washed with 200 mL of 5% sodium 
bicarbonate solution. The organic phase was dried over sodium sulfate, filtered and 
1 0 evaporated. The residue was purified by silica gel chromatography eluting with 2% 
triethylamine in ethyl acetate. The pure fractions were pooled and evaporated. The 
phosphoramidite (13) was precipitated from ether-hexanes: 1.82 g (71%) yield; TLC (5 % 
methanol in ethyl acetate), R f = 0.32; 3l PNMR(DMSO-d 6 ) 8 147.90 and 147.22. 



15 

Example 3 

This example illustrates the preparation of N- {3-[l-((2R,5R)-5 {[bis(4- 
methoxyphenyl)phenylmethoxy]methyl} -4- {[bis(methylethyl)amino](2- 
cyanoethoxy)phosphmooxy}oxolan-2-yl)-6-^ 
20 3-yl)]propyl} -2,2,2-trifluoroacetamide (22) 

3-Iodo-4-methoxypyrazolo[3^-d]pynmidine-6-ylamine(14). 

4-Methoxy-pyrazolo[3,4-^pyrimidin^-ylamine (6.75 g, 40.87 mmol) was 
suspended in an aqueous solution containing sodium acetate (6.0 g, 44.09 mmol) and iodine 

25 monochloride (9.12 g, 56.17 mmol) and stirred at 100 °C in a sealed reaction vessel for 24 h. 
The resulting mixture was cooled to room temperature and treated with a solution of sodium 
metabisulfite (3.6 g, 18.94 mmol) in 24 mL of water. The solid that formed was filtered, 
rinsed with water and dried: 6.93 g (58%) yield; TLC (10% methanol in ethyl acetate), R f = 
0.57; l H NMR (DMSO-d 6 ) 5 13.08 (1H, br s, N-l proton), 10.58 (1H, s, N-5 proton), 6.60 

30 (2H, br s, 6-amino). 

[(2R t 5R)-5-(6-Amino~3-Iodo-4-methoxypyrazolo[ 3, 4-d]pyrimidinyl)-3-( 4- 
methylphenylcarbonyloxy)oxolan-2-yl]methyl 4-methylbenzoate (15). 
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To a suspension of (14) (6.68 g, 22.95 mmol) in 150 mL of methanol was added 8.05 
mL of a 2.85 M methanolic potassium hydroxide. The mixture was stirred for one minute 
and then diluted with 100 mL of toluene and evaporated. The solid potassium salt of 1 was 
dried under vacuum. The potassium salt was suspended in 75 mL of anhydrous DMF and 
then diluted with 420 mL of anhydrous acetonitrile. l-ChIoro-l,2-dideoxy-3,5-di-0- 
toluoylribomranose (8.95 g, 22.95 mmol) was immediately added and the reaction mixture 
was stirred at room temperature under argon for 1 h. and then filtered. The filtrate was 
evaporated and the residual foam was recrystallized twice from methanol: 6.59 g (45%) 
yield; TLC (1:1, ethyl acetate/hexane), R f = 0.52; 'H NMR (DMSO-de) 8 7.93 and 7.37 (8H, 
m and d, J = 7.9 Hz for the d, aromatic), 7.10 (2H, br s, 6-amino), 6.51 (1H, t, J = 6.6 Hz, F- 
H), 5.69 (1H, m, 3'-H), 4.44 (3H, m, 4'- and 5'-Hs), 3.99 (3H, s, 4-methoxy), 3.19 and 2.63 
(2H, 2 x m, 2'-Hs), 2.40 and 2.38 (6H, 2 x s, toluoyl methyl protons). 

[(2R,5R)-5-(6-Amino-3-Iodo^-methoxypyrazolo[3,4-d]pyrimidinyl)-2- 
(hydroxymethyl)oxolan-3-ol (16), 

To a suspension of 15 (32.4 g, 50.40 mmol) in 600 mL of methanol was added 12.5 
mL of 1M sodium methoxide in methanol. The reaction mixture was stirred at reflux for 18 
h. and then cooled in a freezer (-10 °C) . The crystals of 3 that formed were filtered and 
rinsed with ice-cold methanol: 10.95 g yield. Acetic acid (12.5 mL) was added to the filtrate 
and the volume was reduced to about 300 mL. The solution was allowed to sit in the freezer 
overnight and another crop of crystals (3.95 g) was collected. The filtrate was evaporated to 
dryness and the residual oil was triturated in ether. The solid that formed was filtered, dried 
and recrystallized from boiling water 2.03 g. Total yield = 16.93 g (83%); TLC (10% 
methanol in ethyl acetate), R f = 0.23; 'H NMR (DMSO-de) 5 7.02 (2H, br s, 6-amino), 6.33 
(1H, t, J = 6.1 Hz), 5.24 (1H, d, J = 4.4 Hz, 3'-OH), 4.73 (1H, t, J = 5.6 Hz, 5'-OH), 4.36 (1H, 
m, 3'-H), 3.99 (3H, s, 4-methoxy), 3.75 (1H, m, 4'-H), 3.51-3.32 (2H, m, 5'-Hs), 2.79 and 
2.17 (2H, 2 x m, 2'-Hs). Anal. Calcd. For C n H 14 IN 5 04 ' 0.3 H 2 0; C, 32.02; H, 3.57; N, 
16.98. Found C, 32.13; H, 3.35; N, 16.77. 

J-f(2RJR)-(5-hydroxyineihyl)oxolan-2-ylJ-6-amino-3-iodo-5-hydropyrazolof3,4- 
d]pyrimidin-4-one (17). 

A suspension of (16) (16.84 g, 41.36 mmol) in 1.5 L of 1 N sodium hydroxide 
solution was heated to reflux. The mixture turned into a homogenous solution and was 
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cooled to 25 °C using an ice-water bath. Acetic acid (90 mL) was added and the solution was 
stored at 5 °C overnight. The crystals that formed were filtered, rinsed with ice-cold water 
and dried: 7.47 g yield of 4. The filtrate was evaporated to a volume of about 500 mL and 
stored at 5 °C overnight. A second crop of crystals was collected (3.75 g). The filtrate was 

5 evaporated to about 200 mL. The sodium acetate that formed was filtered off and the filtrate 
was stored at 5 °C overnight. Another crop of crystal was collected (2.25 g). Total yield: 
13.47 g (83 %); TLC (5:3:2, n-butanol/water/acetic acid), R f - 0.74; 'H NMR (DMSO-d 6 ) 8 
6.22 (1H, t, J - 6.3 Hz, 1 '-H), 4.33 (1H, m, 3'-H), 3.73 (1H, m, 4'-H), 3.47 and 3.36 (2H, 2 x 
m, 5'-Hs), 2.63 and 2.12 (2H, 2 x m, 2'-Hs). Anal. Calcd. For C10H12IN5O4 * 0.7 H 2 0; C, 

10 29.60; H, 3.33; N, 17.26. Found C, 29.81; H, 3.02; N, 17.00. 

N-(3-{l-[(2R,5R)-4-(hydroxymethyl)oxalan-2-yl]-6-amino-4-oxo(5-hydro^ 
d]pyrimidin-3-yl)}prop-2-ynyl)-2,2,2-trifluoroacetamide(18). 

To a mixture of 17 (6.00 g, 15.26 mmol), Cul (297 mg, 1.56 mmol) and 

1 5 tetrakis[triphenylphosphine]palladium[0] in 30 mL of anhydrous DMF was added anhydrous 
triemylamine (3.14 mL) followed by propargyl trifluoroacetimide (4.29 g, 28.41 mmol). The 
reaction solution was stirred under argon for 40 h. The DMF was evaporated off and the 
residual oil was triturated in chloroform. The solid of crude 5 that formed was filtered, rinsed 
with chloroform and dried. The solid was dissolved in a minimum volume of DMF, absorbed 

20 onto silica gel and evaporated. The dry mixture was loaded onto a silica gel column and 
eluted with 10% methanol in ethyl acetate. The product fractions were pooled and 
evaporated. The residue was precipitated from ethyl acetate/ether: 4.0 g (63 %) yield; TLC 
(20% methanol in ethyl acetate), R f - 0.59; *H NMR (DMSO-d 6 ) 8 10.79 (1H, s, N 5 -H), 
10.16 (1H, t, J = 5.2 Hz, trifluoroacetimido N-H), 6.77 (2H, br s, 6-amino), 6.28 (1H, t, J = 

25 6.3 Hz, 1 '-H), 5.23 (1H, d, J - 4.1 Hz, 3'-OH), 4.72 (1H, t, J - 5.1 Hz, 5'-OH), 4.32 (3H, m, 
-CONH-Cffj_- and 3'-H), 3.75 (1H, m, 4'-H), 3.50-3.29 (2H, 2 x m, 5'-Hs), 2.65 and 2.15 
(2H, 2 x m, 2'-Hs). Anal. Calcd. For CsHsFjNeOj ' 0.74 H 2 0; C, 41.93; H, 3.87; N, 19.56. 
Found C, 42.33; H, 3.64; N, 19.13. 

30 N-(3-{l-[(2R, 5R)-4-(hydroxymethyl)oxalan-2-yl]-6-amino-4-oxo( 5-hydropyrazolof 5-4- 
d]pyriminin-3-yl)}propyl)-2,2,2-trifluoroacetamide(19). 

To a solution of (18) (1.0 g, 2.40 mmol) in 100 mL of methanol, containing 0.12 g of 
20% palladium hydroxide (preactivated with formic acid), was added 2.0 mL of 4 M 



73 



WO 03/107144 



PCT/US03/19447 



triethylammonium formate buffer (pH 6.5). The mixture was shaken under 35 psi of 
hydrogen gas for 18 h (during some runs it was necessary to add additional catalyst for 
complete reduction). The mixture was filtered through Celite and the filtrate was evaporated. 
The residual oil was crystallized from water: 0.79 g (78%) yield; TLC (20% methanol in 
ethyl acetate), R f = 0.52; ! H NMR (DMSO-d*) 8 10.59 (1H, s, N 5 -H), 9.47 (1H, br t, 
trifluoroacetimido N-H), 6.64 (2H, br s, 6-amino), 6.27 (1H, t, J = 6.3 Hz, l'-H), 5.18 (1H, d, 
J = 4.4 Hz, 3'-OH), 4.75 (1H, t, J = 5.9 Hz, 5>-OH), 4.36 (1H, m, 3'-H), 3.75 (1H, m, 4>-H), 
3.48 and 3.61 (2H, 2 x m, 5'-Hs), 3.22, 2.68 and 1.87 (6H, 3 x m, propyl methylene protons), 
2.68 and 2.12 (2H, 2 x m, 2'-Hs). Anal. Calcd. For dsH^NfiOs ' 0.90 H 2 0; C, 41.27; H, 
4.80; N, 19.25. Found C, 41.57; H, 4.50; N, 19.11. 

N-(3-[(lE)-l-aza-2-(dimethylamino)prop-^ 
amino-4-oxo(5-hydropyrazolo[5-4-d]pyrimM 

To a solution of (19) (0.80 g, 1.98 mmol) in 5.0 mL of anhydrous DMF was added 
N,N-dimethylformamide dimethylacetal (3.1 mL). The solution was stirred for 2.0 h. under 
argon and then evaporated. The residue was evaporated twice from xylenes and then placed 
under vacuum. The amorphous solid (20) that formed was triturated with ether and collected: 
773-mg (82%) yield; TLC (20% methanol in ethyl acetate), R f = 0.47; *H NMR (DMSO-d6) 
5 1 1 .22 (1H, br s, N 5 -H), 9.47 (1H, t, J = 5.5 Hz, trifluoroacetimido N-H), 8.67 (1H, s, 
N=€H-N), 6.42 (1H, t, J = 6.5 Hz, I'-H), 5.22 (1H, d, J = 4.3 Hz, 3'-OH), 4.75 (1H, t, J - 6.1 
Hz, 5'-OH), 4.40 (1H, m, 3'-H), 3.77 (1H, m, 4'-H), 3.50 and 3.38 (2H, 2 x s, 5'-Hs), 3.18 
and 3.05 (6H, 2 x s, N,N-dimethyl Hs), 3.22, 2.72 and 1.89 (6H, 3 x m, propyl methylene 
Hs), 2.72 and 2.15 (2H, 2 x m, 2'-Hs). Anal. Calcd. For CigH^NyOs ' 0.40 H 2 0; C, 44.80; 
H, 5.18; N, 20.32. Found C, 45.02; H, 4.96; N, 19.94. 

N-{ 3-f l-( (2R t 5R)-5-{[bis(4-methoxyphenyl)phenylmethoxy]m 
amino-4-oxo(5-hydropyrazolo[3J-d]pyrimidin-3-yl)]propy^^ 

To a solution of (20) (723-mg, 1.52 mmol) in 9.0 mL of anhydrous pyridine was 
added 4,4 , -dimethoxytrityl chloride (0.61 g, 1.80 mmol). The reaction solution was stirred 
for 3.0 h. under argon and then poured into 100 mL of 5% sodium bicarbonate solution. The 
aqueous solution was extracted with ethyl acetate (2 x 200 mL) and the dried extracts 
(sodium sulfate) were evaporated. The crude product was purified by silica gel 
chromatography eluting with a solvent gradient of 0-5% methanol in ethyl acetate (2% 
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triethylamine). The product fractions were evaporated affording 21 as an amorphous solid: 
724-mg (61%) yield; TLC (5% methanol in ethyl acetate), R f = 0.39; ! H NMR (DMSO-d 6 ) 8 
11.27 (1H, s,N 5 -H), 9.43 (1H, t, J = 5.3Hz, trifluoroacetimido N-H), 8.71 (1H, s,N=CH-N), 
7.32, 7.17 and 6.76 (13H, 3xm, aromatic), 6.45 (1H, t, J = 6.3 Hz, l'-H), 5.26 (1H, d, J = 5.3 
5 Hz, 3'-OH), 4.45 (1H, m, 3'-H), 3.90 (1H, m, 4'-H), 3.70 (6H, s, OMe Hs), 3.18 and 3.05 
(10H, 2 x s, N,N-dimethyl, 5'-Hs and CONH-C^), 2.62 and 1.65 (4H, 2 x m, methylene Hs), 
2.62 and 2.20 (2H, 2 x m, 2'-Hs). Anal. Calcd. For C39H42F3N7O7 ' 0.30 H 2 0; C, 59.81; H, 
5.48; N, 12.52. Found C, 59.80; H, 5.39; N, 12.63. 



10 N-{3-[l-((2R,5R)-5{[bis(4-methoxyphenyl)phenylm^^ 
{jbis(methylethyl)amino](2-cyanoethoxy)phosphinoo 
hydropyrazolof 3, 4-d] pyrimidin-3-yl) Jpropyl}-2, 2, 2-trifluoroacetam ide (22 .) 

To a solution of (21) (700-mg, 0.900 mmol) in 22 mL of anhydrous methylene 
chloride, containing 0.47 mL of diisopropylethylamine, was added 2-cyanoethyl N,N- 

15 diisopropylchlorophosphoramidite (0.34 mL, 1.52 mmol). After stirring for 30 minutes under 
argon at 25 °C the solution was treated with 3.0 mL of methanol and diluted with 200 mL of 
ethyl acetate. The solution was washed with 100 mL of 5% sodium bicarbonate solution and 
dried over sodium sulfate and evaporated. The crude product was purified by silica gel 
chromatography eluting with 2% triethylamine in ethyl acetate. The product fractions were 

20 evaporated and the residue was precipitated from ether - hexane: 583-mg (66%) yield; TLC 
(ethyl acetate), Rf = 0.38; 31 P NMR (DMSO-d 6 , reference to 85% phosphoric acid) 5 145.50 
and 144.72. 



Example 4 

25 This example illustrates the preparation of 3-{[5-(4,6-bis{(lE>l-aza-2-[bis(2- 

methylpropyl)amino] vinyl } -3-prop- 1 -ynylpyrazolo[3,4-d]pyrimidinyl)-2- { [bis(4- 
methoxyphenyl)phenylmethoxy]-methyl}oxolan^-yloxy][ethyl(methylethyl)amino}- 
phosphinojpropanenitrile (26) 

30 5-(4,6-bis{(lE)'l-aza-2-[bis(2-methylpropyl)amino]vinyl}^^ 
d]pyrimidinyl)-2 (hydroxymethyl)oxolan-3-ol (24) 

Compound (23) (1 mmol) can be stirred for 5 hr at room temperature with 
(dimethoxyraethyl)bis(2-methylpropyl)amine (0.5 mL, 3.37 mmol). The reaction mixture can 
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be reduced in vacuo, and the protected compound can be purified on silica gel to yield (24) 
(Vincent et al). 

5-(4 t 6-bis{(lE)-l-aza-2-[bis(2-methylpropyl)amino]\nn^ 
d]pyrimidinyl)-2-{[bis(4-methoxyphenyl)pheny 

To a solution of (24) (1.50 mmol) in 9.0 mL of anhydrous pyridine 4,4'- 
dimethoxytrityl chloride (0.61 g, 1.80 mmol) can be added. The reaction solution can be 
stirred for 3.0 h. under argon and then can be poured into 100 mL of 5% sodium bicarbonate 
solution. The aqueous solution can be extracted with ethyl acetate (2 x 200 mL) and the 
extracts can be dried(sodium sulfate) and evaporated. The crude product can be purified by 
silica gel chromatography with an appropriate gradient to yield (25). 

3-{[5-(4, 6-bis{(lE)-l-aza-2-[bis(2-methylpropyl)amino]vinyl}-3-^^ 4- 

d]pyrimidinyl)-2-{[bis(4-methoxyphenyl)phe^ 

yloxy] [ethyl(methylethyl)amino}phosphino}propanenitrile (26) 

To a solution of (25) ( 0.900 mmol) in 22 mL of anhydrous methylene 
chloride, containing 0.47 mL of diisopropylethylamine, 2-cyanoethyl N,N- 
diisopropylchlorophosphoramidite (0.34 mL, 1.52 mmol) can be added. The solution can be 
stirred for 30 minutes under argon at 25 °C and the can be treated with 3.0 mL of methanol 
and diluted with 200 mL of ethyl acetate. The solution can then be washed with 100 mL of 
5% sodium bicarbonate solution and dried over sodium sulfate and evaporated. The crude 
product can be purified by silica gel chromatography with an appropriate gradient to yield 
(26). 

Example 5 

This example illustrates the synthesis of 5-[4,6-diamino-3-(2- 
methoxyethynyl)pyrazolo[3,4-d]pyri^ (27). 

5-[ 4, 6-diamino-3-(2-methoxyethynyl)pyrazolo[ 3, 4-d]pyrimidinyl] -2-(hydroxymethyl)oxolan- 
3 -ol (27) 

To a mixture of 4 J 6-diamino-l-(2-deoxy-P-D-ery^rc)-pentofuranosyl)-3-iodo- 
li/-pyrazolo[3,4-d]pyrimidine (4.2 g, 10.71 mmol), Cul (211 mg, 1.10 mmol), Pd(PPh 3 ) 4 
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(635 mg, 0.553 mmol), and triethylamine (2.2 mL) in 20 mL of anhydrous DMF was added 
methyl propargyl ether (1 .82 mL). The mixture was stirred under argon for 16 h. and then 
evaporated to dryness. The residue was crystallized from methanol: 3.20 g (89%) yield; 
TLC (30% methanol in ethyl acetate), R f = 0.63; ! H NMR (DMSO-d 6 ) 6 7.44 (2H, br d, 
amino), 6.32 (1H, t, J - 6.6 Hz, l'-H), 6.33 (2H, br s, amino), 5.20 (1H, d, J = 4.1 Hz, 3'- 
OH), 4.75 (1H, br t, 5'-OH), 4.40 (2H, s, methylene), 4.36 (1H, m, 3'-H), 3.76 (1H, m, 4'-H), 
3.47 & 3.32 (2H, 2 x m, 5'-Hs), 3.32 (3H, s, methoxy), 2.68 & 2.14 (2H, 2 x m, 2'-Hs). 

Synthesis of 5-{6-[(lE)-l-aza-2-(dimethylamino)vinyl] 

(dimethylamino)vinyl]pyrazolo[3 t 4-d]pyrim /- 
methoxyprop-l-yne (28). 

Compound 27 (3.1 g, 9.28 mmol) was stirred in a solution of 30 mL of N,N~ 
dimethylformamide and 15 mL of M^-dimethylformamide dimethyl acetal for 14 h at 45 °C. 
The solvents were evaporated and the residue was evaporated twice from xylenes affording 
28 as a foam which was precipitated from ethyl acetate - ether: 2.8 g (68%) yield; TLC (50% 
methanol in ethyl acetate), R f = 0.36. 

Synthesis of 3-[(5-{6-[(lE)-l-aza-2-(dimethylam 

(dimethylamino)\nnylJpyrazolo[3^Jpyrimidinyl}^ 

metho^Jmethyl}oxolan-3'yloxy)[bis(methylethyl)aminoJphospM 

To a solution of 28 (2.7 g, 6.08 mmol) in 45 mL of anhydrous pyridine was 
added dimethoxytrityl chloride (2.4 g). The resulting solution was stirred for 3 h. at room 
temperature and then poured into 200 mL of 5% sodium bicarbonate solution. The mixture 
was extracted with ethyl acetate and the extract was dried over sodium sulfate, filtered and 
evaporated. The residue was purified by silica gel chromatography eluting with a gradient of 
0-40% methanol in ethyl acetate. The pure 5 '-O-DMT derivative fractions were pooled and 
evaporated affording a foam: 1.0 g (22%) yield. 

To a solution of DMT derivative (0.98 g, 1.31 mmol) in 16 mL of anhydrous 
methylene chloride, containing 0.70 mL of A^diisopropylethylamine was added 2- - 
cyanoethyl diisopropylchlorophosphoramidite (0.50 mL) under argon. The solution was 
stirred for 30 min at room temperature and then treated with 1.0 mL of methanol. The 
solution was loaded directly on to a silica gel column and eluted with a gradient of 0-20% 
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methanol in ethyl acetate (2% triethylamine). The pure fractions were pooled and evaporated 
affording a foam: 0.25 g (20%) yield. 

Example 6 

5 

This example illustrates the preparation of 3-[(5-{4-[(lZ)-l-aza-2- 
(methylamino)vinyl]-3-(2-ftiryl)^ -2- {[bis(4- 

methoxyphenyl)phenylmethoxy]methyl} oxolan-3-yloxy)[bis(methylethyl)amino] - 
phosphinooxyjpropanenitrile (35) (See Reaction Scheme 8) 

10 

(2-Furylmethoxymethylene)methane-l t 1-dicarbonitrile (30) 

To an ice-cold solution containing anhydrous methylene chloride (500 mL), 
triethylamine (100 mL), and malonitrile (30g, 454 mmoi) was added 2-furanyl chloride (50 g, 
383 mmol) by dropwise addition over a 20 min period. Stirring was then continued for an 
1 5 additional hour at ambient temperature. The reaction solution was washed with 1 .5 L of ice- 
cold 2N HC1 solution and then with 1 .5 L of water. The organic phase was evaporated 
affording an oil. 

A portion of the product oil (4.6 g, 28.75 mmol) was dissolved in a solution 
consisting of 40 mL of dioxane and 4.0 mL of water. Dimethyl sulfate (15 mL) and sodium 

20 bicarbonate (15g) were added and the reaction solution was stirred for 2.5 h at 80 °C. The 

mixture was then dissolved in 100 mL of water and the product was extracted with 200 mL of 
ethyl acetate. The organic solution was washed with 100 mL of 5% sodium bicarbonate 
solution followed by 100 mL of water. The solution was dried over sodium sulfate, filtered 
and evaporated affording an oil which solidified under vacuum: 2.72 g yield; TLC (1:1, 

25 ethyl acetate/hexane), R f = 0.42; *H NMR ((DMSO-^) 5 8.25 (1H, m, aromatic), 7.63 (1H, 
m, aromatic), 6.92 (1H, m, aromatic), 4.23 (3H, s, methoxy). 

2-Amino-4-(2-furyl)juran-3-carbonitrile (31) . 

Hydrazine monohydrate (3.4 mL, 80 mmol) was added dropwise to an ice- 
30 cold solution of 30 (2.72 g, 15.63 mmol) in 75 mL of methanol over a period of 15 min. The 
solution was then evaporated to dryness affording an oil which solidified under vacuum. The 
solid was triturated in ether, filtered and dried: 2.2 g (81%) yield; TLC (ethyl actate), Rf = 
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0.81; ] H NMR (DMSO-4?) 8 7.77 (1H, s, furanyl), 6.80 (1H, m, furanyl), 6.61 (1H, ra, 
furanyl), 6.41 (2H, br s, amine). 

3-(2-furyl)pyrazolo[3A'd]pyrimidi7ie-4-ylamine (32). 
5 Compound 31 (25.7g, 148 mmol) was stirred in 250 mL of formamide at 190 

°C for 4 h. The solution was then cooled to room temperature, diluted with 1 .2 L of water 
and chilled in an ice-bath. The solid that formed was filtered and dried: 22 g (74%) yield; 
TLC (5% methanol in ethyl actate), R f = 0.25. 

10 [5-(4-Amino-3-(2-furyl)pyrazolo[3,4-d]pyrim^ 
oxoethyl]oxolan-2-yl] methyl 4-methylbenzoate (33). 

Compound 32 (10 g, 49.7 mmol) was stirred in 200 mL of 0.29 M methanolic 
KOH solution for 5 min. The mixture was evaporated to dryness and the residue was then 
dissolved in 40 mL of hot anhydrous DMF. The solution was cooled to room temperature 

15 and then diluted with 230 mL of anhydrous acetonitrile. The chlorosugar derivative (23 g, 
59.14 mmol) was added immediately and the mixture was stirred for 45 min and then 
evaporated to dryness. The residue was dissolved in 800 mL of ethyl acetate and washed 
with water (2 x 800 mL). The organic solution was dried over sodium sulfate, filtered and 
evaporated. The crude product was purified by silica gel chromatography eluting with a 

20 gradient of 30-0% hexane in ethyl acetate. The pure product fractions were evaporated and 
the residue was crystallized from hot methanol: 3.4 g (12%) yield; TLC (ethyl actate), R f = 
0.47; l H NMR ((DMSO-^) 5 8.27 (1H, s, 6-H), 7.96 (1H, m, furanyl), 7.95, 7.84, 7.36 & 
7.23 (8H, 4 x d, toluoyl aromatic), 6.83 (1H, m, fiiranyl), 6.80 (1H, t, J=6.3 Hz, l'-H), 6.70 
(1H, m, furanyl), 5.86 (1H, m, 3'-H), 4.64-4.42 (3H, m, 4'-H and 5'-Hs), 3.28 & 2.76 (2H, 2 

25 x m, 2'-Hs), 2.39 & 2.36 (6H, 2 x s, toluoyl methyls). 

5-{4-[(lZ)-l-aza-2-(dimethylaimno)v^ 
(hydroxymethyl)oxolan-3-ol (34). 

A suspension of 33 (3.36 g, 6.08 mmol) in 17 mL of 0.12 M methanolic 
30 sodium methoxide solution was refluxed for 30 min. The resulting solution was cooled to 
room temperature and neutralized by addition of 0. 12 mL of acetic acid. The solution was 
evaporated to dryness and the product was precipitated from methanol-ether and then 
recrystallized from boiling water: 1.63 g (85%) yield of the deprotected nucleoside. 
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All of the nucleoside product from above was stirred in a solution consisting 
of 30 mL of anhydrous DMF and 15 mL of N,N-dimethylformamide dimethylacetal for 5 h. 
The solution was evaporated to dryness and the residue was then evaporated twice from 
xylenes affording a foam: 1.89 g(99%) yield; TLC (20% methanol in ethyl actate),R f = 
5 0.45; 'H NMR ((DMSO-rf*) 8 8.98 & 8.46 (2H, 2xs, formamidine C-H and 6-H), 7.97 
(1H, m, furanyl), 7.81 (1H, m, fiiranyl), 6.65 (2H, m, l'-H and fiiranyl), 5.30 (1H, d, J = 4.5 
Hz, 3 5 -OH), 4.79 (1H, t, J = 5.5 Hz, 5'-OH), 4.51 (1H, m, 3'-H), 3.85 (1H, m, 4'-H), 3.57 & 
3.40 (2H, 2 x m, 5'-Hs), 3.26 & 3.21 (6H, 2 x s, N-Me protons), 2.88 & 2.30 (2H, 2 x m, 2'- 
Hs). 

10 

3-[(5-{4-[(lZ)-l-aza-2-(methylamino)vinyl]-3-(2fa 
{[bu(4-methoxyphenyl)phenylmethoxy]methy 
phosphinooxyjpropanenitrile (35). 

Dimethoxytrityl chloride (2.01 g) was added to a solution of 34 (1.84 g, 4.95 

15 mmol) dissolved in 30 mL of dry pyridine. The reaction solution was stirred for 3 h at room 
temperature and then poured into 200 mL of 5% sodium bicarbonate solution. The product 
was extracted with 300 mL of ethyl acetate and the organic phase was dried over sodium 
sulfate and evaporated. The residue was purified using silica gel chromatography eluting 
with a gradient of 0-5% methanol in ethyl acetate. The pure product fractions were 

20 evaporated affording a foam: 2.26 g (68%) yield of the 5'-0-DMT derivative. 

To a solution of the DMT derivative (2.18 g, 3.23 mmol) dissolved in 40 mL 
of dry methylene chloride, containing 1.75 mL of N,N-diisopropylethylamine, was added 
1.25 mL of 2-cyanoethyl diisopropylchlorophosphoramidite. The solution was stirred under 
argon for 1 5 min at room temperature and then treated with 5 mL of methanol. The resulting 

25 solution was diluted with 500 mL of ethyl acetate and washed with 400 mL of 5% sodium 
bicarbonate solution. The organic solution was dried over sodium sulfate, filtered and 
evaporated. The residue was purified by silica gel chromatography eluting with a gradient of 
5-10 % methanol in ethyl acetate (2% triethylamine). The pure product fractions were 
evaporated affording a foam: 1.62g(57%)yield; 31 PNMR(DMSO-^) 5 147.81 and 

30 147.16. 
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Example 7 



This example illustrates the preparation of 3-[(5-{6-[(lE)-l-aza-2- 
(dimethylamino) vinyl] -4-[( 1 Z)- 1 -aza-2-(dimethylamino)vinyl]-3-(2-fuiyl)pyrazolo[3,4- 
5 d]pyrimidinyl} -2- {[bis(4-methoxyphenyl)phenylmethoxy]methyl} oxolan-3- 

yloxy)[bis(methylethyl)amino]phosphinooxy]propanenitrile (39, See Reaction Scheme 9) 



3-(2-furyl)pyrazolo[3, 4-d]pyrimidine-4, 6-diamine (36) 

A mixture of finely ground 31 (10.0 g, 57.47 mmol) and guanidine carbonate 

10 (16.6 g, 91.95 mmol) was heated at 230 °C for 45 min. The mixture was cooled to room 

temperature and the solid was triturated in 100 mL of boiling water. The solid (pure 36) was 
filtered, rinsed with water and dried: 1 1.1 g (89%) yield; TLC (40% methanol in ethyl 
actate),R f =0.66; l H NMR ((DMSO-40 8 12.68 (1H, br s, N-H), 7.86 (1H, m, furanyl), 
6.93 (2H, br s, -NH 2 ), 6.85 (1H, d, J = 3.5 Hz, furanyl), 6.65 (1H, m, furanyl), 6.10 (2H, br s, 

15 -NH 2 ). 

5-(4,6-diamino-3-(2-furyl)pyrazolo[3 t 4~d]pyrimidm^ (37) 

Compound 36 (10.5 g, 48.61 mmol) was stirred in 200 mL of 0.29 M 
methanolic KOH solution for 5 min. The mixture was evaporated to dryness and the residue 

20 4 was then dissolved in 105 mL of hot anhydrous DMF. The solution was cooled to room 
temperature and then diluted with 620 mL of anhydrous acetonitrile. The chlorosugar 
derivative (23 g, 59.14 mmol) was added immediately and the mixture was stirred for 40 min 
and then filtered. The filtrate was evaporated to dryness and the residue was 
chromatographed through a silica gel column eluting with ethyl acetate. The nucleoside 

25 fractions were evaporated affording 2.8 g (10 % yield) of the toluoyl protected nucleoside as 
a mixture of alpha- and beta-anomers. This mixture was refluxed in 40 mL of 0.19 M 
methanolic sodium methoxide for 45 min. The reaction solution was placed in a freezer 
overnight producing a crop of crystals corresponding to the pure beta-anomer 37: 690 mg 
yield; TLC (20% methanol in ethyl actate), R f = 0.32; l H NMR (DMSO-<k) 6 7.90<1H, m, 

30 furanyl), 6.99 (2H, br s, amino), 6.86 (1H, d, J = 4.1 Hz, furanyl), 6.68 (1H, m, furanyl), 6.41 
(1H, t, J = 6.6 Hz, l'-H), 6.26 (2H, br s, amino), 5.21 (1H, d, J - 4.4 Hz, 3'-OH), 4.82 (1H, t, 
J = 5.8 Hz, 5'-OH), 4.42 (1H, m, 3'-H), 3.79 (1H, m, 4'-H), 3.52 & 3.41 (2H, 2 x m, 5'-Hs), 
2.75 & 2.18 (2H, 2 x m, 2'-Hs). 
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5-{6-[(lE)-l-aza-2-(dimethylamino)vinyl]-4-[(lZ)-l-am-2-(dimethylam^ 
furyl)pyrazolo[3jHiJpynmidinyl}-2-(hydroxymethyl)oxolan-3-ol(3S). 

A solution of 37 (0.68 g, 2.05 mmol) in 30 mL of anhydrous DMF and 15 mL 
of N,N-dimethylformamide dimethylacetal was stirred at room temperature for 24 h. The 
solution was evaporated to dryness and the residue was evaporated twice from xylenes 
affording a foam: 0.90 g (99%) yield; TLC (50 % methanol in ethyl actate), R f = 0.38; l H 
NMR (DMSO-ck) 5 8.93 & 8.77 (2H, 2 x s, formamidine C-Hs), 7.94 (1H, m, furanyl), 
7.76 (1H, m, furanyl), 6.62 (1H, m, furanyl), 6.60 (1H, t, J = 6.6 Hz, l'-H), 5.25 (1H, d, J - 
4.4 Hz, 3'-OH), 4.84 (1H, t, J = 5.9 Hz, 5'-OH), 4.47 (1H, m, 3'-H), 3.83 (1H, m, 4'-H), 3.56 
& 3.41 (2H, 2 x m, 5'-Hs), 3.25, 3.18, 3.16 & 3.03 (12H, 4 x s, N-methyls), 2.82 & 2.22 (2H, 
2xm,2'-Hs).] 

H(HH(lE)-l-aza-2-(dimethylamino)vinyl]-4-[(lZ)-lHiza-2-(dime^ 

furyl)pyrazolo[3,4-d]pyrimidinyl}-2-{[bis(4-methoxypliaiyl)ph^ 

yloxy)[bis(methylethyl)amino]phosphinooxy]propanenitrile(39). 

Dimethoxytrityl chloride (0.85 g) was added to a solution of 38 (0.90 g, 2.04 
mmol) in 12 mL of dry pyridine. The reaction solution was stirred for 2 h at room 
temperature and then poured into 200 mL of 5% sodium bicarbonate solution. The product 
was extracted with ethyl acetate (3 x 300 mL) and the pooled organic extracts were dried over 
sodium sulfate and evaporated. The residue was purified using silica gel chromatography 
eluting with a gradient of 20-30% methanol in ethyl acetate. The pure product fractions were 
evaporated affording a foam: 286-mg yield of the 5'-0-DMT derivative. 

To a solution of the DMT derivative (286 mg, 0.384 mmol) dissolved in 5 mL 
of dry methylene chloride, containing 0.23 mL of N,N-diisopropylethylamine, was added 
0.17 mL of 2-cyanoethyl diisopropylchlorophosphoramidite. The solution was stirred under 
argon for 15 min at room temperature and then treated with 0.5 mL of methanol. The 
resulting solution was diluted with 100 mL of ethyl acetate and washed with 75 mL of 5% 
sodium bicarbonate solution. The organic solution was dried over sodium sulfate, filtered 
and evaporated. The residue was purified by silica gel chromatography eluting with a 
gradient of 0-30 % methanol in ethyl acetate (2% triethylamine). The pure product fractions 
were evaporated affording a foam: 230 mg (12%) yield; 31 P NMR (DMSO-4$) 8 147.77 
and 147.08. 
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Example 8 

This example illustrates the preparation of 4-[l-(5-{[bis(4- 
methoxyphenyl)phenylmethoxy]methyl} -4- {[bis(methylethyl)amino](2- 
cyanoethoxy)phosphinooxy}oxolan-2-yl)-2^ 
methylbenzoate (see, 42 Reaction Scheme 10) 

Synthesis of3-Butyn-l-yl p-Toluate (40) 

In a 500 mL round bottom flask equipped with a magnetic stirring bar and 
under inert atmosphere, 3-butyn-l-ol (50.3 g, 0.718 mol) was dissolved with anhydrous 
pyridine (200 mL) and the solution was cooled in an ice water bath. To the cold solution, 
using an addition funnel, p-Xo\uy\ chloride (136.6 g, 0.86 mol, 1.2 eq) was added dropwise 
and the reaction mixture was stirred at room temperature overnight. 

To the mixture were added diethyl ether (350 mL) and water (100 mL). The 
organic layer was separated, and the aqueous solution was washed with diethyl ether (150 
mL). The organic fractions were combined and washed with 10% HC1 (3 x 100 mL), 
saturated NaHCC>3 solution (2 x 100 mL) and water (1 x 50 mL). The resulting solution was 
dried over anhydrous Na2SC>4, filtered and the solvent was removed to give 142 g 
(quantitative yield) of (40) as an off-white solid. The product can be recrystallized form 
hexane or methanol, but was sufficiently pure to proceed to the next step, 

*H NMR (CDCI3, 300 MHz) 8 (ppm) 7.95 (d, 2H, / « 8.1 Hz), 7.24 (d, 2H, J 
= 8. 1 Hz), 4.41 (t, 2H, J = 6.8 Hz), 2.67 (dd, 2H, Jj =2.6 J 2 = 6.8 Hz), 2.40 (s, 3H), 2.03 (t, 
1H, .7=2.4 Hz). 

5 '-DMT-5-[4-(p-toluyloxy)butynyl]-2 '-deoxyurdine (41) 
A mixture of 5-iodo-2'-deoxyuridine (4.0 g, 11.30 mmol), 4-(p- 
toluyloxy)butyne (40) (5.7 g, 30.3 mmol), Cul (222 mg, 1.16 mmol), 
tetrakis[triphenylphosphine]palladium[0] (0.67 g, 0.583 mmol) and triethylamine (2.0 mL) 
was stirred in 30 mL of anhydrous DMF under argon for 16 h. The mixture was evaporated 
to dryness and the residual oil solidified after stirring in 100 mL of ether. This solid was 
filtered and dried under vacuum. 
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A portion of the erode product (1.5 g) was dissolved in 20 mL of anhyrdrous 
pyridine and treated with dimethoxytrityl chloride (1.3 g). The solution was stirred for 2.0 h. 
under argon and then poured into 150 mL of 5% sodium bicarbonate solution. The product 
was extracted with 200 mL of ethyl acetate. The extract was dried over sodium sulfate, 
filtered and the solvent evaporated. The residue was purified by silica gel chromatography 
eluting with a gradient of 30-0% hexane in ethyl acetate). The product fractions were 
evaporated, affording a foam: 957-mg yield. TLC (30% hexane in ethyl acetate), 0.37; 
'H NMR (DMSO-ds) 8 1 1 .66 (1H, s, N-H), 7.9 - 6.8 (18H, aromatic protons), 6. 1 1 (1H, t, J 
= 6.6 Hz, l'-H), 5.35 (1H, br s, 3'-OH), 4.29 (1H, m, 3'-H), 4.15 (2H, t, /= 6.8 Hz, butyne 
methylene), 3.92 (1H, m, 4'-H), 3.71 (6H, s, methoxy groups), 3.24 & 3.07 (2H, 2 x m, 5'- 
Hs), 2.64 (2H, t, J= 6.8 Hz, butyne methylene), 2.37 (3H, s, toluyl methyl), 2.23 (2H, m, 2'- 
Hs). 

4-[l-(5-{[bis(4-methoxyphenyl)phenylmethoxy]methyl}-4- 
{[bis(methylethyl)amino](2-cyanoethoxy)phosphinooxy}o^ 
dioxo-l,3-dihydropyrimidin-5-yl]but-3-ynyl 4-methylbenzoate (42) 
To a solution of the DMT derivative from above (0.92 g, 1.28 mmol) in 15 mL 
of anhydrous methylene chloride, containing 0.75 mL of Wdiisopropylemylamine, was 
added 2-cyanoethyl diisopropylchlorophosphoramidite (0.56 mL). The solution was stirred 
for 30 min at room temperature under argon and then treated with 1.0 mL of methanol. The 
solution was evaporated down to about a 5 mL volume and loaded directly on to a silica gel 
column and eluted with 40% hexane in ethyl acetate (2% triethylamine). The pure fractions 
were pooled and evaporated, affording a foam: 0.91 g of (42) (78%) yield; 3I P NMR 
PMSO-d 6 ) 8 147.72 & 147.39. 

Example 9 

This example illustrates the enhanced ability of modified oligonucleotides to 
discriminate between related target sequences. 

In this example, modified oligonucleotides were prepared containing PPPA, 
PPG and a minor groove binder. As Figure 3 illustrates, the use of a MGB in combination 
with PPPA and PPG increases the T m and allows the design of shorter probes compatible 
with PCR extension temperatures. These modified oligonucleotides also yield increased 
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mismatch discrimination. In a first design of a 18-mer fluorescein-ODN-Red 13quencher- 
MGB real-time PCR probe, poor discrimination of an A/T mismatch was observed as shown 
in Figure 3 A. Re-design of the probe against the opposite strand, putting the mismatch under 

the MGB and substituting PPPA for A and PPG for G as indicated, allows shortening of the 
probe to a 1 5-mer. This probe now gives good mismatch discrimination as shown in Figure 
3B. 

A thermodynamic investigation of mismatch discrimination was performed on 
a set of oligonucleotides hybridized to a set of targets perfectly match or containing a single 
mismatch. The target sequences contain a) normal As, b) PPPA, c) normal As and a 3'-MGB 
and d) PPPA and a 3*-MGB, respectively. The sequences for the probes and targets are 
shown below in Table 3a and 3b. The determination of T m s and the calculation of 



Table 3a and 3b 
Probe and target sequences 



A. Probe Sequences - Mismatch Underlined 


Number 


Mismatch 


Probe Sequence 


1 


Complement 


AAAGTTATGTCTACTTACAGAAA 


2 


A/C 


AAAGCTATGTCTACTTACAGAAA 


3 


A/C 


AAAGTCATGTCTACTTACAGAAA 


4 


T/G 


AAAGTTGTGTCTACTTACAGAAA 


5 


A/C 


AAAGTTACGTCTACTTACAGAAA 


6 


C/A 


AAAGTTATATCTACTTACAGAAA 


7 


A/C 


AAAGTTATGCCTACTTACAGAAA 


8 


G/T 


AAAGTTATGTTTACTTACAGAAA 


9 


A/C 


AAAGTTATGTCCACTTACAGAAA 


10 


T/G 


AAAGTTATGTCTGCTTACAGAAA 


11 


G/T 


AAAGTTATGTCTATTTACAGAAA 


12 


A/C 


AAAGTTATGTCTACCTACAGAAA 


13 


A/C 


AAAGTTATGTCTACTCACAGAAA 


14 


T/G 


AAAGTTATGTCTACTTGCAGAAA 


B. Target Sequences - A'=PPPA 


1 


GTAAGTAGACATAAC 


2 


GTA'A'GTA'GA'CA'TA'A'C 


3 


GTAAGTAGACATAAC-MGB 


4 


GTA'A'GTA'GA'CA'TA'A'C-MGB 



85 



4 4 > 

WO 03/107144 PCT/US03/19447 



Table 3c 

Thermodynamic comparison of mismatch discrimination in terms of free energy 
5 increment at 50°C where AAG^^R^nOKmatch/Kmismatch) 



Number 


Mismatch 


A 


PPPA 


MGB 


MGB+PPPA 


2 


A/C 


2340 


2930 


2870 


5320 


3 


A/C 


2560 


3280 


4100 


6320 


4 


T/G 


1950 


1810 


4200 


5900 


5 


A/C 


3520 


3760 


3830 


4980 


6 


C/A 


5030 


5340 


4190 


5970 


7 


A/C 


3000 


3370 


4310 


5260 


8 


G/T 


3040 


3260 


3070 


4820 


9 


A/C 


3290 


3440 


3810 


5630 


10 


T/G 


1800 


1950 


2090 


3350 


11 


G/T 


3340 


3120 


3630 


5070 


12 


A/C 


2940 


3620 


2550 


4490 


13 


A/C 


2360 


3210 


1820 


3980 


14 


T/G 


1600 


2010 


2000 


2480 



AAG°5o is described in Example 9. Table 3c clearly shows increased mismatch 
10 discrimination when PPPA is substituted for A and even larger discrimination when PPPA is 
combined with a MGB. 

Comparison of the thermodynamic discrimination of mismatched base pairs 
formed in MGB-ODNs containing HO-PPPA/HO-PU with PPPA/PU at 37°C is shown in 
Table 4. The ODNs containing the modified bases in combination with MGB were 
1 5 hybridized to their complements. Mismatches are underlined in the sequences shown in 
Table 4. As shown in this table, HO-PPPA and HO-PU substitution compared to PPPA and 
PU, shown for the most part increased mismatch discrimination. 
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Table 4 



Comparison of thermodynamic discrimination of mismatched base pairs formed by 
HOPPPA or HOPU vs PPPA and PU in the 8-mer duplexes (+MGB). 





Sequence of Duplex 


PPPA/ 
PU 


HOPPPA/ 
HOPU 




Sequence of 


PPPA/ 
PU 


HOPPPA/ 
HOPU 






AAG 
cal/mol 


AAG 
cal/mol 






AAG 
cal/mol 


AAG 
cal/mol 


Match 


CGUCACUG-MGB 
AGCTGTGACT 






Match 


UAUUAUUG-MGB 
AATAATAACC 






1 


CGUCACUG-MGB 
AGCTGTGACT 


4250 


4350 


10 


UAUUAUUG-MGB 
AATTATAACC 


4400 


5000 


2 


CGUCACUG-MGB 
AGCGGTGACT 


3450 


3540 


11 


UAUUAUUG-MGB 
AATGATAACC 


3740 


3760 


3 


CGUCACUG-MGB 
AGCCGTGACT 


4860 


4530 


12 


UAUUAUUG-MGB 
AATCATAACC 


6630 


6840 


4 


CGUCACUG-MGB 
AGCAGAGACT 


4870 


4850 


13 


UAUUAUUG-MGB 
AATAAAAACC 


5090 


5730 


5 


CGUCACUG-MGB 
AGCAGGGACT 


4190 


4360 


14 


UAUUAUUG-MGB 
AATAAGAACC 


5920 


6520 


6 


CGUCACUG-MGB 
AGCAGCGACT 


3930 


3940 


15 


UAUUAUUG-MGB 
AATAACAACC 


4120 


4530 


7 


CGUCACUG-MGB 
AGCAATGACT 


2600 


2300 










8 


CGUCACUG-MGB 
AGCATTGACT 


4360 


4210 










9 


CGUCACUG-MGB 
AGCACTGACT 


4420 


4610 











AAG was calculated at 37°C. 



Example 10 

This example illustrates the UV melting studies carried out using 
oligonucleotides of the present invention. 

Hybrids formed between unmodified ODNs or MGB-probes and their 
complements were melted at a rate of 0.5°C/min, on a Lambda 2S (Perkin-Elmer) 
spectrophotometer with a PTP-6 automatic multicell temperature programmer. T m data was 
prepared using 0.5x SSPE buffer (Sigma, pH 7.4). Each ODN (luM of each strand) was 
mixed with its complement to give a 1:1 ratio. Prior to melting, samples were denatured at 
100°C and then cooled to 10°C over a 10 min period. Mismatch discrimination for each type 
of duplex was calculated in terms of AAG at 50°C using the equation: 
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AAG°5 0 = R°ln(K^ te h^n^ tch ). 

The term KnWKmismatch can be determined using the relative fractions of 
duplex and single strands calculated from the melting curves at 50°C (see Lohkov, S.G. & 
Pyshnyi, FEES Letters 420:134-138 (1997)). 

Example 11 

This example illustrates the hybridization of DNA to oligonucleotide glass 
microarrays prepared as described in co-pending application Ser. No. 09/364,320 and further 
optimized for 8-10 mer MGB probes. 

DNA (oligonucleotide or PCR amplicon) at 1-5 x 10" 7 M in 5XSSPE, 0.1% 
Trition X-100, 10% formamide was hybridized to the microarray in Frame-Seal Chambers 
(MJ Research) under the following conditions: 5 minutes at 55°C, slow cooling at 0.1 °C/sec 
to 35 °C, 60 minutes at 55°C. The slides were then washed in 0.5XSSPE for 5 minutes at 
45°C. Slides were dried under a stream of air and scanned using an Array Works Fluorescent 
scanner (Applied Precision). The washing procedure can be repeated under more stringent 
conditions if required. 

Example 12 

This example illustrates a single base primer extension assay. 

Single stranded template DNA and 6-mer primer (20|xM each), IX Thermopol 
buffer (New England Biolabs), lOmM manganese chloride (USB), were combined and 
incubated 5 minutes at 50°C. Then 5 \iCi oc 32 P ddATP and 8 units Bst polymerase (NEB) 
were added, bringing total volume to lOjul, and incubated 15 minutes at 50°C. After 
incubation, 6|il stop solution (95% formamide, 20mM EDTA) was added and reactions were 
cooled to room temperature. Samples were diluted 1 :10 in denaturing dye (35% formamide, 
0.05% xylene cyanol, 0.05% bromophenol blue, ImM EDTA), and aliquots were analyzed by 
electrophoresis on a denaturing 10% polyacrylamide gel. 
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The use of modified bases in primer extension assays for SNP typing is 
illustrated in Table 5. Primer extension with Bst polymerase (NEB) was evaluated with a 
primer substituted with different modified bases and the amount of product was measured 
based on the incorporated radioactivity using gel electrophoresis. 

5 



Table 5 

Comparison of 32 P-incorporation in primer extension product by polyacrylamide 
gel electrophoresis using AAC CAC TCT GTC CTA template 



Primer 


Relative Signal 


TGAGAC 


-H- 


U^GAGAC 


+ 


JjProPQprop AGAC 


+++ 


UPro PGP rop A prop GAC 


-H-H-+ 


jjpropQprop A propQjprop A Q 


! 1 1 II 


jJ>rop{y>roppj>ropQg>roppj>ropQ 


+ 


^jprop^y>rop pj>ropQprop p^propQprop 


0.5+ 



10 

Example 13 

This example illustrates the use of the algorithm described in the specification 
to predict the T m of modified oligonucleotides containing PPG both with and without a minor 

1 5 groove binder (CDPI 3 ) 

Using thermodynamic parameters in Tables 2a/2b and the correction formula 
for the MGBs contribution, T m of PPG oligonucleotides with and without a MGB can be 
calculated with algorithm procedures as described herein. The thermodynamic parameters of 
the nearest neighbor pairs that are not G-containing do not change when the oligonucleotides 

20 are substituted with PPG. Similarly the thermodynamic parameters of the nearest-neighbor 
pairs do not change when MGB containing oligonucleotides are substituted with PPG: 
Table 6 illustrates the ability of the algorithm to predict the T m of oligonucleotides containing 
PPG alone or in combination with a minor groove binder (CDPI3). In Table 6, all Gs are 
substituted with PPGs. 

25 
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Table 6 

Comparison of experimental T m s with that of predicted T m s using the nearest-neighbor 
thermodynamic parameters for PPG containing oligonucleotides and PPG containing 
oligonucleotides attached to a MGB 





ODN Duplex Stability °C 


MGB-ODN Duplex Stability 
°C 


S'-Probe Sequence -3' 




Tm ca ic 


Err 


1 MI SXD 


■ , "C3IC 


Err 


CTGTAAGTAGATATAAC 


51 84 


53 23 


1 39 


65.88 


66.69 


0.81 


GGCAAGA1A1AJ.AG 


50 21 


4Q 81 


-0.40 


66.37 


65.56 


-0.81 


GTGACGCAGATTCC 


fi1 97 


R1 OR 


-0 21 


76.97 


75.19 


-1 .78 


GTAAGTAGACATAAC 


52 12 


51 78 


-0.34 


64.64 


63.31 


-1.33 


CAGGGAGCTTTGGA 


59.9 


60.22 


0.32 


74.39 


71.47 


-2.92 


CACTCGTGAAGCTG 


60.85 


59.49 


-1.36 


74.04 


72.26 


-1.78 


G 1 AAG 1 AGGUA I AAt 


55.74 


55.47 


-0.27 


66.91 


66.00 


-0.91 


CGGGAIGIAGGAJLG 


57 52 


59 05 


1 53 


69.3 


70.03 


0.73 


GATTACCTGGATTT 


50.64 


50 32 


-0 32 


62.29 


62.33 


0.04 


CCGTCAATGGTCAC 


58 fifi 


fin m 


1 35 


70.13 


69.91 


-0.22 


CAGCACGTAGCC 


57 31 


58 07 


0 76 

V. 1 \I 


69.29 


67.60 


-1.69 


CGGCTACGTGCTGG 




fifi m 


0 82 


76 12 

f U. 1 €m 


74 79 


-1 .33 


CGGCTACATGCTGG 


R1 14 


81 Q5 


n 81 


71.56 


72.99 


1.43 


CTAAATCTGCCG 


3Uf*t 


48 HQ 


-2 31 


62.08 


60.19 


-1 .89 


TCTGGATGATGGGCA 


fi1 74 


R1 Q5 


0 21 


71.65 


72.13 


0.48 


GTTCATGGGTGTAAT 


57 51 


57 77 


n 26 


66.94 


68.79 


1.85 


CGGAGGTAGGATCA 


5Q 94 


5Q 4fi 


0 22 


69 46 


70 93 


1.47 


CCACCCGCCTCAG 


fiO 73 


R1 14 


0 41 


71 .43 


70.74 


-0.69 


CACAGGAGTGGTTGG 


r^i n7 


K4 40 


1 33 


72 28 


72 92 


0 64 


CGGACCAGTGCGTG 


fift 1 

DO. 1 


R7 58 
Of .uo 


-0 52 


77 92 


76.80 


-1.12 


TCGGACCAGTGCGT 


fi5 n4 


fifi nn 

DD.UU 


n Qfi 


74 94 


75 62 


0 68 


TV TV y"T y^l ^T fTl TV ^I^ITV m TV 

AACGGGGTACGATA 


57 


57 1 1 
Of . 1 1 


-0 89 


fi7 79 


fi7 08 


-0 71 


CAGTTGAGATTCTAAGAC 


fin nfi 


fin 15 


0 HQ 


67 15 


67 43 


0.28 


AGGGGCGTCTTG 


fin 7R 

QU. f O 


58 57 


-9 91 


71 62 


72.76 


1.14 


GTAAGTAGGCATAGC 


58.34 


58.95 


0.61 


65.95 


66.99 


1.04 


TGCCCAGCCCCAG 


63.13 


63.40 


0.27 


71.28 


71.32 


0.04 




54.87 


56.14 


1.27 


62.07 


63.54 


1.47 


GTAAGTAGACACAGC 


59.48 


58.41 


-1.07 


65.79 


66.27 


0.48 


TCGGACCAGTGC 


58.02 


58.55 


0.53 


65.99 


66.35 


0.36 


CGATCACGCTGGC 


62.12 


62.75 


0.63 


69.18 


71.81 


2.63 


GTCCTGGGGGTGG 


65.19 


64.54 


-0.65 


72.78 


72.53 


-0.25 


GTAAGTAGGTGTGAC 


60.7 


59.70 


-1.00 


66.92 


67.00 


0.08 I 


GGTTGTACGGGTTCACG 


68.38 


68.81 


0.43 


74.16 


75.38 


1.22 


GGACCAGTGCGTGA 


66.84 


65.46 


-1.38 


73.38 


71.53 


-1.85 


GTAAGTAGACGCAGC 


62.91 


62.44 


-0.47 


68 


I 67.82 


-0.18 


GTAAGTAGGCGCAGC 


65.52 


65.91 


0.39 


69.8 


70.34 


0.54 


GTAAGTAGGCGCGGC 


68.71 


68.96 


0.25 


72.26 


72.76 


0.50 


GGTTCCCGAGCG 


62.15 


61.14 


I -1.01 


65.75 


64.22 


-1.53 
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. The accuracy of the prediction algorithm is about +/- 1 and +/- 2°C for PPG 
containing oligonucleotides and PPG containing MGB-oligonucleotides, respectively. 
Accordingly, a sequence of interest can be obtained from a source such as Genbank, and a T m 
window can then be set as a requirement for a set of probes or primers. Using the algorithms 
5 described herein, and information from the nearest-neighbor parameters, a collection of probe 
or primer sequences with the desired T m s can be calculated. 

Alternatively, the algorithm can be used to select nearest-neighbor parameters 
from a selection of modified bases and to calculate the stability of more than one sequence of 
the same length to give the same predetermined thermodynamic stability, with the selected 

10 modified bases. 

Still further, the algorithm can be used to select one or more modified bases 

that allow the design of oligonucleotides of similar base pair length and substantially the 

same stability (T m ). These modified bases can be selected from a database containing pre- . 

determined thermodynamic nearest neighbor parameters from an collection of modified 
15 purines and/or modified pyrimidines. Preferably, the database contains nearest-neighbor 

parameters of 3-substituted pyrazolo[3,4-d]pyrimidines and/or 5-substituted pyrimidines. 

Example 14 



.TM 



20 This example illustrates the use of modified oligonucleotides in an Invader 

assay. 

The cleavase-based assay is shown in Figure 4 detects specific DNA and RNA 
sequences to cleave a complex formed by the hybridization of two overlapping 
oligonucleotides to a target. The enzyme cleaves the overhanging "flap", which serves as an 
25 invader probe in the detection cassette probe where cleavage releases a fluorescence signal. 
The first cleavage takes places only when the single base mismatch in the invader is a perfect 
match. No reaction takes place with a mismatch target B. The cleaved "flap" serves as an 
invader in the detection cassette, leading to the release of fluorescence in the second cleavage 
step. 
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Table 7a and 7b 

Comparison of Cleavase-bases amplification system with different modified bases substituted 

in the genomic and invader probes. 

5 

Table 7a 



# 


Probe 1 


Modified Bases Substituted in Probe Sequence 2 


Length 


1 


Cassette 


None 


41 


2 


Invader T 


None 


59 


3 


Invader Ml 


Five A24 bases 


32 


4 


Genomic T 


None 


43 


5 


Genomic Ml 


Six A14 bases 


35 


6 


Genomic M2 


Five A24 bases 


31 



1 Unmodified Probes and assay conditions are similar to those described by Hall et al, PNAS 
10 97:8272-77(2000) 

2 A14 is hydroxypropynylPPA and A24 is 3-iododiaminoPPA 

Table 7b 



Genomic probe 


Endpoint Match/Mismatch 
Signal Ratio 


AFRFU 3 


4 


5 


4100 


5 


10 


8000 


6 


7 


11900 



3 AF is end point fluorescence difference between match and 



1 5 mismatch in relative fluorescence units 

Table 7a shows a comparison of the effect of different modified bases when 
substituted in the Invader and genomic probes. The traditional invaderT (2) was substituted 
with 3-iododiaminoPPPA to give invader Ml which now has a length of 32 with a similar T m 
20 as the 59-mer (2). Similarly the traditional genomic probe (4) was substituted 3- 

hydroxypropynylPPPA and 3-iododiaminoPPPA to give (5) and (6) respectively. Both of 
these probes are substantially shorter than the traditional (4). Using invaderMl (3) in 
combination with the genomic probes 4-6 individually results in (Table 7b) improved 
performance of both genomic probes containing the modified bases (5 and 6) compared to the 
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traditional genomic probe (4). As shown both the endpoint match/mismatch signal ratios 
and the end point fluorescence show increases compared to the unmodified genomic probe. 

Software and Computer System Overview 

Figure 6 illustrates a general overview of a web-based oligonucleotide probe 
design system 10 including a server 50 and a client system 20 according to an embodiment of 
the present invention. In system 10, client system 20 is coupled through the Internet 40, or 
other communication network, e.g., over any LAN or WAN connection, to server system 50. 
As will be described herein, probe design system 10, in particular client system 20 and server 
system 50, is configured according to the present invention to define and create sequence 
detection probes in response to user input design criteria. It should be understood that, 
although only one client system 20 is shown and discussed herein, any number of client 
systems may be communicably coupled to server system 50. The probe design system of the 
present invention advantageously allows a user to design nucleotide sequences having 
optimal or near-optimal T m s and particularly nucleotide probes having one or more modified , 
bases as described herein and having optimal or near-optimal T m s. Generally, "probe" as 
used herein is intended to refer to a sequence detection probe, a PCR primer (left or right) or 
an amplicon (sequence from the 5' end of a left primer to the 5' end of a right primer). 

Several elements in the system shown in Figure 6 include conventional, well- 
known elements that need not be explained in detail here. For example, each client system 20 
could include a desktop personal computer, workstation, laptop, or any other computing 
device capable of interfacing directly or indirectly with server 50, e.g., over the Internet. 
Client system 20 typically runs a browsing program, such as Microsoft's Internet Explorer, 
Netscape Navigator, Opera or the like, allowing a user of client system 20 to access, process 
and view information and pages available to it from server system 50 or other server systems 
over Internet 40. Client system 20 also typically includes one or more user interface devices 
22, such as a keyboard, a mouse, touchscreen, pen or the like, for interacting with a graphical 
user interface (GUI) provided by the browser on a display 23 (e.g., monitor screen, LCD 
display, etc.), in conjunction with pages, forms and other information provided by server 
system 50 or other servers. The present invention is suitable for use with the Internet, which 
refers to a specific global Internetwork of networks. However, it should be understood that 
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other networks can be used instead of the Internet, such as an intranet, an extranet, a virtual 
private network (VPN), a non-TCP/IP based network, any LAN or WAN or the like. 

According to one embodiment, server system 50 and all of its components are 
operator configurable using an application including computer code run using a central 
processing unit 53 such as an Intel Pentium processor or the like. Computer code including 
instructions for operating and configuring server system 50 to process data content and 
communicate with client system 20 as described herein is preferably stored on a hard disk, 
but the entire program code, or portions thereof, may also be stored in any other volatile or 
non-volatile memory medium or device as is well known, such as a ROM or RAM, or 
provided on any media capable of storing program code, such as a compact disk (CD) 
medium, digital video disk (DVD) medium, a floppy disk, and the like. Additionally, the 
entire program code, or portions thereof, may be transmitted and downloaded from a software 
source, e.g., from server system 50 to client system 20 or from another server system or 
computing device to server system 50 over the Internet as is well known, or transmitted over 
any other conventional network connection (e.g., extranet, VPN, LAN, etc.) using any 
communication medium and protocols (e.g., TCP/IP, HTTP, HTTPS, Ethernet, etc.) as are 
well known. It should be understood that computer code for implementing aspects of the 
present invention can be implemented in a variety of coding languages such as C, C++, Java, 
Visual Basic, and others, or any scripting language, such as VBScript, JavaScript, Perl or 
markup languages such as XML, that can be executed on server system 50 and/or on a client 
system. In addition, a variety of languages can be used in the external and internal storage of 
data according to aspects of the present invention. 

According to one embodiment, an application (represented as module 55) 
executing on server system 50 includes instructions for designing probes based on user input 
criteria. Application 55 is preferably downloaded and stored in a hard drive 52 (or other 
memory such as a local or attached RAM or ROM), although application module 55 can be 
provided on any software storage medium such as a floppy disk, CD, DVD, etc. as discussed 
above. In one embodiment, application module 55 includes various software modules for 
processing data content, such as a user interface communication module 57 for 
communicating text and data frames to client system 20 through a data port, e.g., for 
rendering as active browser windows on client system 20, and for receiving data from client 
system 20. Further, user interface module 57 may also provide a browser application to 
client system 20. Application module 55 also includes a T m calculation module 59 including 
instructions to determine the T m of an oligonucleotide sequence using nearest neighbor 
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thermodynamic parameters as discussed herein. Values of nearest neighbor thermodynamic 
parameters, including, for example, those provided in Table 2 above, are preferably stored in 
memory 52, or other memory or storage system, for retrieval by T m calculation module 59. It 
should be appreciated that application module 55 and the appropriate data can be downloaded 

5 to and executed on client system 20. 

As used herein, the term "server system" will typically include one or more 
logically and/or physically connected servers distributed locally or across one or more remote 
geographic locations. Additionally, the term "server" will typically include a computer 
system and an associated storage system and database application as is well known in the art. 

10 In a preferred embodiment, the probe design application 55 is configured to 

design a probe system including a left and right primer and one or more sequence detection 
probes. Such probe systems are useful for Gene expression, DNA detection, SNP detection, 
etc. In this embodiment, the application first determines candidate probes and evaluates 
characteristics of the probes against minimum and maximum parameter values such as T m 

1 5 values, GC% values and longest mononucleotide run. In one embodiment, the user may 

selectively enter, or modify default values for, the minimum and maximum constraint values 
to narrow or expand the number of qualifying systems. Preferably, only probes that lie 
within the range of minimum and maximum values for all attributes qualify for further 
consideration and are scored and displayed. 

20 Once a probe is selected for further consideration, left and right primer 

windows are constructed based on the position of the probe. For each window, the process of 
candidate primer generation, calculation of primer-specific attribute values, and attribute 
evaluation follows that given above regarding probes. Next, the application generates all 
possible amplicons from the list of left and right primers. For L left primers and R right 

25 primers there are L*R candidate amplicons. Each candidate amplicon is evaluated with 
respect to minimum and maximum parameter values for attributes such as length and 
propensity for the left primer to anneal to the right primer. In addition, the position of the left 
primer is evaluated to make sure that it ends upstream of the probe, e.g., at least lObp 
upstream of the probe. The position of the right primer is evaluate to make sure it does not 

30 overlap the probe. Only qualifying amplicons that pass the minimum and maximum 

constraints for all parameters are considered further and are scored. Candidate systems are 
constructed from qualifying amplicons and probes. All possible remaining pair-wise 
comparisons are made to calculate and evaluate the tendency for primers and probes to anneal 
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to each other. Only qualifying systems that pass the minimum and maximum constraints for 
all parameters are scored and returned to the user. 

A score function is assigned to each qualifying probe, primer, amplicon and 
system in one embodiment. The score measures the distance of the qualifying probe, primer, 
amplicon or system from an ideal probe, primer, amplicon or system. An ideal primer, probe, 
amplicon or system is one that has an ideal value for all attributes. For example, an ideal 
primer has a T m of 68.0°C and a 50% GC content. In one embodiment, the scoring function 
has the form: 

E(abs(Ideal - Actual)*weight). 

This formula is described as the summation across attributes of the absolute 
value of the difference between the actual value calculated for a primer, probe, amplicon or 
system and its ideal value, multiplied by a weighting factor. The absolute value is taken so as 
to accumulate both positive and negative distances across all parameters. The user adjustable 
weighting factor is used to adjust the scale of different parameters as well as to place more 
importance on some parameters than on others. 

Figure 7 illustrates an example process flow according to one embodiment of 
the present invention. The user first calls up the application module 55 to begin the process, 
e.g., by logging into server 50, or executing application 55 locally. When the user first 
accesses the probe design application 55, a main window is displayed in step 100. The 
user/operator enters a target sequence or a polynucleotide sequence including a target 
sequence. The user may enter a sequence manually in step 105. For example a form is 
provided to the user with a space for entering one or more nucleotide sequences. 
Alternatively or additionally, the user may enter a GenBank GI number that identifies a 
particular nucleotide sequence. If a GenBank GI number is received, the system retrieves the 
identified nucleotide sequence from a GenBank server 60 (Figure 6) over the Internet or other 
network in step 1 10. The system logs into the GenBank server, provides the GenBank GI 
number and the GenBank server provides the identified nucleotide sequence. 

After a sequence is obtained, the application identifies one or more possible 
nucleotide sequences based on the received sequence. For example, in one embodiment, the 
application generates a list of one or more candidate sequences based on the received 
sequence. The candidate sequences may include, or be derived from, the entire received 
sequence or portions of the received sequence or sequences complementary to portions of the 
received sequence, e.g., probes and primers. For example, if the user selects a general probe 
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design mode in step 1 15, the application determines appropriate probe systems including 
candidate probes, primers and amplicons based on the received sequence as discussed above. 
In step 120, the received sequence is examined to determine whether a valid sequence was 
entered. If there is an error, an error message is returned in step 125 and the process returns 
to the main page. If the sequence is valid, a first probe (e.g., probe, primer or amplicon, 
depending on stage of process) is determined in step 130. 

In step 135, the T m of that probe is calculated by T m calculation module 59. 
T m calculation module 59 preferably determines the T m of a nucleotide sequence according to 
the teachings of the present invention. In particular, T m calculation module 59 predicts the 
T m for each candidate nucleotide sequence using nearest neighbor thermodynamic parameters 
as discussed above. In preferred aspects, at least some of the candidate sequences include at 
least one or more modified bases and optionally an MGB as described herein. T m calculation 
module 59 retrieves nearest neighbor parameters from memory 52 and returns a T m for the 
given sequence. In step 140, it is determined whether the calculated T m is within a suitable 
range. If not, the process discards the probe and proceeds to the next probe. If within a 
suitable range, the process proceeds to store a record of the test probe sequence and T m to 
memory in step 155, e.g., in a table. Optional steps 145 and 150 are provided to determine 
other characteristics of the test probe, such as, for example, the number of self complements 
and the number of hairpins. The process repeats steps 130 to 155 for each test probe (e.g., 
probe, primer, amplicon). At step 160, after all test probes have been analyzed, the probes 
are displayed in a table, preferably including the T m values as well as other pertinent 
characteristic data. 

For a specific probe or primer, T m is the predicted temperature at which half of 
the primers or probes are in the single stranded state and half are in the double stranded state. 
Probe and primer T m is calculated using the nearest neighbor model as described above. In 
one embodiment, the calculation can be generalized as: 

T m = dH/(dS + 1.987*Log(C/2)) - 273.15, 

where entropy (dH) and enthalpy (dS) are accumulated across the 
oligonucleotide and where C is the oligo concentration. This formula proposes that - 
Cprobe^Ctarget (complement), dH-~»dH° and dS-»dS°. Where a MGB is used, a MGB 
adjustment (MGB_dds) is included as follows: 

T m - dH/(dS + MGB_dds + 1 ,987*Log(C/2)) - 273.15. 
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A Single Site polymorphism mode may also be selected, as shown in Figure 7. 
Once a corresponding nucleotide sequence has been identified or obtained, the process 
essentially repeats the process for the general probe design mode (e.g., steps 120-160) as 
discussed above. In some embodiments, mutant T m s probes are analyzed in step 185. For 

5 example, one may design a probe to a mismatch, e.g., ATT GCG [T/G]CT ACT , 

where the mismatch is indicated in brackets. The algorithm designs a complementary probe 

against the 'Wild-type", in this case ATT GCG [T ]CT ACT It also designs a 

complementary probe against the mutant sequence ATT GCG [G ]CT ACT In 

some cases the mutant sequence contains more than one mismatch in the probe region. 

10 All the nearest-neighbor parameters are preferably stored in the Test parameter 

data base, e.g., in memory. If the program moves to, for example, the Use modified A T or 
G, it retrieves from the data base the appropriate nearest neighbor parameters. 

In one embodiment, the Tm of a nucleotide sequence, or a portion thereof, as 
entered by the user is calculated in step 1 80 using the teachings of the present invention or 

1 5 other known techniques. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. Although the 
20 foregoing invention has been described in some detail by way of illustration and example for 
purposes of clarity of understanding, it will be readily apparent to those of ordinary skill in 
the art in light of the teachings of this invention that certain changes and modifications may 
be made thereto without departing from the spirit or scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1 LA computer-implemented method of predicting the melting 

2 temperature (T m ) of an oligonucleotide sequence, comprising: 

3 identifying an oligonucleotide having a sequence of N bases; and 

4 calculating a melting temperature (T m ) of the identified oligonucleotide 

5 sequence using the nearest neighbor thermodynamic parameters for each of the N-l 

6 neighboring base pairs in the identified sequence, each nearest neighbor thermodynamic 

7 parameter defining a thermodynamic contribution of two corresponding neighboring bases. 

1 2. The computer-implemented method of claim 1 , wherein identifying 

2 includes receiving a polynucleotide sequence from Genbank. 

1 3 . The computer-implemented method of claim 1 , wherein identifying 

2 includes receiving a user input polynucleotide sequence. 

1 4. The computer-implemented method of claim 1, wherein identifying 

2 includes receiving a polynucleotide sequence and identifying one of a portion of the received 

3 sequence and a sequence complementary to a portion of the received sequence. = 

1 5. The computer-implemented method of claim 1 , wherein said identified 

2 sequence includes at least one modified base. 

1 6. The computer-implemented method of claim 5, wherein said at least 

2 one modified base includes a member selected from the group consisting of a base attached to 

3 an amino acid, a locked nucleic acid sugar and a universal base. 

1 7. The computer-implemented method of claim 5, wherein said at least 

2 one modified base includes a member selected from the group consisting of PPA, PPG, 

3 PPPA, PPPG, PU, PC, HOPU, HOBuU, HOBuC, HOBUPPA, (NH 2 ) 2 PPPA, 

4 (NH 2 ) 2 PPPAOH, (NH 2 ) 2 BuPPAOH, (NH 2 ) 2 PPAI, (NH 2 ) 2 PPABr, (NH 2 ) 2 PPAC1, PPAI, 

5 PPABr, PPAC1 and HOBuPPG. 

1 8. The computer-implemented method of claim 5, wherein said identified 

2 oligonucleotide sequence has a minor groove binder (MGB) attached thereto. 

1 9. The computer-implemented method of claim 8, wherein said MGB has 

2 a formula selected from the group consisting of: 
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3 




4 wherein 

5 the subscript m is an integer of from 2 to 5 ; 

6 the subscript r is an integer of from 2 to 10; and 

7 each R a and R b is independently a linking group to said modified oligonucleotide, H, 

8 -OR c , -NR c R d , ~COOR c and -CONR c R d wherein each R c and R d is selected from the group 

9 consisting of H, (Ci-Ci2)heteroalkyl, (C2-Ci2)heteroalkenyl, (C 2 -Ci2)heteroalkynyl, (d- 
10 Ci 2 )alkyl, (C 2 -Ci 2 )alkenyl, (C 2 -Ci 2 )alkynyl, aryl(CrCi 2 )alkyl and aryl. 

1 10. The computer-implemented method of claim 1 , further comprising: 

2 retrieving said nearest neighbor thermodynamic parameters for each of the N- 

3 1 neighboring base pairs in the identified sequence from a memory. 

1 1 1 . A computer readable medium including computer code for controlling 

2 a processor to calculate the melting temperature (T™) of an oligonucleotide, the computer 

3 code including instructions to: 

4 identify an oligonucleotide sequence responsive to user input, said sequence 

5 having N bases; 

6 retrieve nearest neighbor thermodynamic parameters for each of the N- 1 

7 neighboring base pairs in the identified sequence from a memory, each nearest neighbor 

8 thermodynamic parameter defining a thermodynamic contribution of two corresponding 

9 neighboring bases; and 

1 0 calculate a melting temperature (T m ) of the identified oligonucleotide 

1 1 sequence using the retrieved nearest neighbor thermodynamic parameters. 

1 12. The computer-readable medium of claim 11, wherein said user input 

2 includes a Genbank code, wherein said instructions to identify include instructions to 

3 automatically retrieve a polynucleotide sequence from Genbank over a network using said 

4 input Genbank code. 

1 13. The computer-readable medium of claim 1 1 , wherein said user input 

2 includes a polynucleotide sequence. 
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1 14. The computer-readable medium of claim 11, wherein said instructions 

2 to identify includes instructions to identify one of a portion of a received polynucleotide 

3 sequence and a sequence complementary to a portion of the received sequence. 

1 15. The computer-readable medium of claim 1 1, wherein said identified 

2 sequence includes at least one modified base. 

1 16. The computer-readable medium of claim 15, wherein said at least one 

2 modified base includes a member selected from the group consisting of a base attached to an 

3 amino acid, a locked nucleic acid sugar and a universal base. 

1 17. The computer-readable medium of claim 15, wherein said at least one 

2 modified base includes a member selected from the group consisting of PPA, PPG, PPPA, 

3 PPPG, PU, PC, HOPU, HOBuU, HOBuC, HOBUPPA, (NH 2 ) 2 PPPA, (NH 2 ) 2 PPPAOH, 

4 (NH 2 ) 2 BuPPAOH, (NH 2 ) 2 PPAI, (NH 2 ) 2 PPABr, (NH 2 ) 2 PPAC1, PPAI, PPABr, PPAC1, and 

5 HOBuPPG. 



1 18. The computer-readable medium of claim 1 5 , wherein said identified 

2 oligonucleotide sequence has a minor groove binder (MGB) attached thereto. 

1 19. The computer-readable medium of claim 18, wherein said MGB has a 

2 formula selected from the group consisting of: 



3 




4 wherein 

5 the subscript m is an integer of from 2 to 5; 

6 the subscript r is an integer of from 2 to 10; and 

7 each R a and R b is independently a linking group to said modified 

8 oligonucleotide, H, -OR c , -NR c R d , -COOR c and -CONR c R d wherein each R c and R d is 

9 selected from the group consisting of H, (Ci-Ci 2 )heteroalkyl, (C 2 -C] 2 )heteroalkenyl, (C 2 - 

10 Ci 2 )heteroalkynyl, (C r Ci 2 )alkyl, (C 2 -Ci 2 )alkenyl, (C 2 -C, 2 )alkynyl, aryl(Ci-Ci 2 )alkyl and 

11 aryl. 
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1 20. A computer system for predicting the melting temperature (T m ) of an 

2 oligonucleotide sequence, comprising: 

3 a memory unit for storing nearest neighbor thermodynamic parameters; 

4 means for identifying a nucleotide sequence having N bases, wherein at least 

5 one of said N bases includes a modified base; and 

6 processing means, coupled to the memory unit, for retrieving nearest neighbor 



7 parameters for each of the N-l neighboring base pairs in the identified sequence from the 

8 memory unit, and for calculating a melting temperature (T m ) of the identified oligonucleotide 

9 sequence using the nearest neighbor thermodynamic parameters, each nearest neighbor 

10 thermodynamic parameter defining a thermodynamic contribution of two corresponding 

11 neighboring bases. 



1 21 . The system of claim 20, wherein the means for identifying includes 

2 means for receiving a user entered nucleotide sequence. 

1 22. The system of claim 20, wherein the means for identifying includes 

2 means for receiving a GenBank GI number specifying a nucleotide sequence stored at a 

3 GenBank server, the system further including means for retrieving the specified sequence 

4 from a GenBank server. 

1 23 . The system of claim 20, wherein the means for identifying includes 

2 means for receiving a polynucleotide sequence having N or greater bases and identifying one 

3 of a portion of the received sequence and a sequence complementary to a portion of the 

4 received sequence. 

1 24. The computer-implemented method of claim 5, wherein said identified 

2 oligonucleotide sequence has attached to it one or more of a minor groove binder (MGB), a 

3 fluorophore (Fl) and a quencher (Q). 

1 25 . The computer-implemented method of claim 9, where the minor 

2 groove binder is attached to the oligonucleotide via a quencher molecule. 
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